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SKIN-FOLD CHANGES WITH INCREASING OBESITY 
IN YOUNG AMERICAN MALES * 


BY RUSSELL W. NEWMAN 


Quartermaster Research and Development Center 
Natick, Massachusetts 


INTRODUCTION 


URING a recent anthropometric examination of approximately 
2,000 young American men, soon after induction into the United 


States Army and before basic training, skin-fcld measurements were 
obtained. The availability of skin-fold data on such a large series and 
yet one which was remarkably homogeneous in terms of age and physical 
selectivity makes it possible to conduct statistical comparisons of sub- 
samples within this group with reasonable validity. The basic descrip- 
tion of the skin-fold and related measurements and their demographic 
distribution will be treated in another report. Two problems are 
examined here: (1) the question of utilizing skin-fold measurements 


* Presented at the annual meeting of the American Association of Physical 
Anthropologists, Yellow Springs, Ohio, March 28, 1954. 
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obtained on American Negroes to calculate their body fat; and (2) 
whether data on large numbers of men of varying degrees of obesity shed 
any further light on the relationship between subcutaneous fat and 
total obesity. 

It is not the purpose of this paper to review the history and theory 
of skin-fold measurements as indicators of obesity; this has been covered 
in a series of recent review articles (BroZek and Keys, ’51; BroZek, 53; 
Keys and BroZek, 53). However, a brief statement of some of the 
knowns and unknowns is pertinent to the problems presented here. The 
most interesting aspect of subcutaneous fat as measured by skin-fold 
thickness is in its relationship with a calculation of total body fat 
obtained by densiometry. Essentially, this technique consists of a close 
statistical association between the skin-fold thickness at selected sites 
and body fat (as a percent of weight) calculated from the average 
density (weight divided by body volume). Brozek and Keys (’51) have 
demonstrated such a close relationship for 133 University of Minnesota 
college men that a multiple regression equation based on the skin-fold 
thickness at the chest, arm, and abdomen can be used to calculate body 
fat with a high degree of reliability. There are a number of limitations 
inherent in the use of this equation, only two cf which will be discussed 
here: (1) since this equation was based on White college students, can 
it be applied equally to other racial groups, in this case American 
Negroes? (2) since it is a single multiple regression equation which 
fits all young men whether lean or obese, it assumes that the skin-fold 


thicknesses at all sites represented in the equation increase linearly with 


increasing obesity. Is this true when a large sample is analyzed? Un- 
fortunately, one vital portion of the data on the Army men, a reliable 
and independent measure of body volume from which average density 
could be calculated, was not available. None of the densiometric 
techniques now available or expected in the near future (e. g., under- 
water weighing, air displacement, gas dilution), could handle such large 
numbers of men in the time available. This lack of an independent 
method of caleulating body fat limits the analyses to the observed data, 
skin-fold thickness, body fat calculated by the Brozek and Keys’ equation, 
and total hody weight. 

Some of the pertinent elements of the measuring technique are 
reported here because the increasing number of measuring instruments 
and skin-fold sites in use might otherwise eventually result in the 


grouping of basically inapplicable data. Five skin-fold sites were 
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measured on these subjects: chest—above and to the right of the right 
nipple; arm—on the back side, half-way up the upper arm; back— 
immediately below the right scapula; abdomen—to the left of the navel; 
and knee—above the right kneecap. The subjects were measured in an 
upright position by two technicians and their average value was used. 
Each investigator measured every skin-fold site at least three times 
before proceeding to the next area. The calipers used were a slight! 
modified version of the Minnesota type skin-caliper similar to th. 
described by Brozek (’52) and illustrated by Newman (752). The 
caliper points had a diameter of 6 mm with a surface pressure of approxi- 
mately 5 g per sq mm. 

The series used here were 1,702 White males and 292 American 
Negro males. This latter group will be referred to subsequently as 
Negroid to acknowledge the obvious racial admixture which was present 
in this sample. The average age of these men was 20.7 years with a 
relatively restricted age range, 17 to 28 years. The effect of physical 
selection on these men is unknown, but it probably deviates no more 
from a truly random sample of young American men than a series based 
on volunteer college students. 


NEGROID-WHITE COMPARISONS 


Any detailed comparision of White and Negroid skin-folds is com- 
plicated by the fact that the Negroids are superficially far leaner than 
the Whites. For example, the average Negroid body fat calculated from 
Brozek and Keys’ regression equation was 3.9%, approximately half the 
comparable white value of 6.8%. This preponderance of small skin-fold 
values in the Negroids results in a lack of individuals whose skin-fold 
values can be compared with the obese White subjects. This paper is 
not concerned with why the Negroids show lower skin-fold values but 
whether their subeutaneous fat is relatively rather than absolutely com- 
parable to Whites in terms of gross measured quantities. 

One method of investigating the relationships between skin-fold sites, 
irrespective of the amount present, is to calculate correlation coefficients 
for the 5 areas. The 10 possible coefficients are shown in table 1 for 
both racial groups. It is apparent that Whites and Negroids show a 
similar arrangement of coefficients in bodily areas; che chest, back, and 
abdomen skin-folds are highly intercorrelated with each other and less 
correlated with the extremities (arm and knee) which form another 


correlated set. The two matrices of coefficients are very close in magni- 
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tude, but the Negroid coefficients are slightly lower in value. There 
does not seem to be any readily apparent reason for this difference 
unless it is the effect of racial mixture appearing as heterogeneity in 
the coefficients. 

A statistical test of significant difference between the comparable 
White and Negroid correlation coefficients was applied to the 19 pairs 
given in table 1. Since the numbers of men involved for each site 
relationship were constant, a common significance level of ¢ of 1.96 at 
the 5% level and 2.58 at the 1% level were used. The results are shown 
below in table 2. It is difficult to evaluate such a table where two 


TABLE 1 


Correlation Coefficients between Skin-fold Sites of Measurement 


ARM BACK KNEE ABDOMEN 

WHITES 

Chest 79 .20 .73 .90 

Arm .78 83 81 

Back 74 87 

Knee 75 
NEGROIDS 

Chest 79 .8U .67 86 

Arm .74 82 80 

Back .68 86 

Knee 72 


t would 


relationships show significant difference and 8 do not, but 
appear that the skin-fold relationships between the bodily areas are at 
least roughly similar in these groups. A possible explanation for the 
difference in chest-abdomen coefficients may be brought out in the next 
section. 

Lacking any measure of body volume, it was not possible to construct 
regression equations between skin-fold values and calculated body fat 
for these two groups and to compare them. Furthermore, it was not 
possible to throw much light on the question of racial differences in body 
fat from a regression equation based on Whites alone. Because of this 
postulate, the samples were divided into increments of apparent obesity 
on the best evidence available, a summation of the measured skin-folds. 
The term “apparent obesity’ is used for two reasons: (1) It must 
be assumed that the two principal fat depots, superficial (subcutaneous) 


and deep (largely visceral), increase in roughly equal proportion with 


increasing obesity, This has been shown in Whites hy the close corre- 
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spondence between skin-folds and tissue density, but it remains an 
assumption for the Negroids. (2) It is apparent from Brozek and 
Keys’ (51) prediction equation between skin-fold thickness and body- 
fat from densiometry that the various sites do not have equal weight in 
this association, for example, abdomen, chest, and arm have relative 
values of 15, 39, and 46%, respectively. No differential weightings 
could be used with this Army data without prejudicing the results, and 
it is believed that no large error has been introduced by the simple 
addition of the skin-fold measurements into a summation value which 
must roughly align the men into increments of increasing obesity. 








TABLE 2 

Comparison of White and Negroid Values of r 

SITE “¢” VALUE SIGNIFICANCE 
Chest-arm 0.16 None 
Chest-back 5.47 1% 
Chest-knee 1.88 None 
Chest-abdomen 2.81 1% 
Arm-back 1.41 None 
Arm-knee 0.62 None 
Arm-abdomen 0.47 None 
Back-knee 1.72 None 
Back-abdomen 1.25 None 
Knee-abdomen 0.94 None 





The White and Negroid samples were divided into 5mm intervals 
based on the sum of abdominal, arm, and chest skin-folds. The mean 
value for each of the three skin-fold sites for all men in each summation 
interval was calculated and plotted in figure 1. The entire sample of 
1,702 Whites and 292 Negroids is not represented in this figure which 
shows only those individuals whose combined skin-fold value totaled 
45mm or less. Above this value there were not enough Negroids to 
avoid erratic sampling errors. The actual numbers of men represented 
by the plotted values in figure 1 are given in appendix A. 

The lines of mean skin-fold values in this figure closely approximate 
regression lines and may be tested statistically for significant differences 
in slope. This was accomplished for each racial pair and the lines were 
found to be not significantly different. It is apparent, however, that 
the Negroid has low skin-fold values in chest and arm and accompanying 
high abdominal skin-fold thickness as contrasted with the Whites. These 
differences were visible at the time of measurement. The Negroids were 
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characterized by well-developed and well-defined musculature in the torso 
and upper extremities, and musculature could not be well-defined when 
covered by a thick layer of subcutaneous fat. Hooton (’48) mentioned 
this when comparing body build photographs in 3,000 Negroids and 
30,000 Whites. He said that “the shoulders of the Negro customarily 
show relatively greater muscular development than is found in Whites. 
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Fig. 1. COMPARISON OF THREE WHITE AND THREE NEGROID SKIN-FOLDS. 


The Trapezii and Deltoids are characteristically well developed, as are 
also the pectoral muscles, the abdominal muscles, and those of the back. 
Thigh muscles are also generally good.’ 

The minimal] values for men who have 10 to 15 mm total for the 
three skin-fold sites are remarkably consistent between the racial groups. 
Two points may be inferred from this: (1) the average value must be 
very close to fat-free, doubled skin-fold thickness; (2) it suggests that 
there is no measurable difference in fat-free skin-fold thickness between 
Negroids and Whites, at least at these particular sites. Tanner (751) 


records three millimeters as the minimum skin-fold thickness found in 


approximately 400 British students. No skin-fold reading of a lower 
magnitude was recorded for the 2.000 men reported here. 
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It is possible to estimate the amount of body fat by substituting the 
mean values given in figure 1 for any given interval in the BroZek and 
Keys’ (751) prediction equation. This assumes that the overall sub- 
cutaneous fat to average body density relationship is basically similar 
for the two groups, and further that the Negroid superficial versus deep 
fat ratio is the same as that found in the White subjects. A calculation 
of percent body fat using the mean skin-fold values, rounded to whole 
millimeters, for the largest interval used in figure 1 (40 to 45 mm) 
gave a White value of 8.4% and a Negroid value of 7.9%. One half of 
one per cent is a very small difference for a measurement as full of 
potential pitfalls as the estimation of body fat. 

The first question asked in this study, whether skin-fold measure- 
ments on American Negroes can be converted to estimates of per cent 
body-fat from relationships established on White subjects, cannot be 
answered definitively from the data presented here, but it appears that 
ne great error would result from euch a practice. The relationships 
shown in table 1 and figure 1 are considered sufficiently similar to 
warrant using the total series of 1,994 men in examination of the next 
question, whether data on a large group of young men can show any 
new relationships between skin-folds and obesity. 


MEAN VALUES FOR THE COMBINED SERIES 


The use of the combined series, which now might be categorized as 
young American male, offers a much larger range of obesity than could 
be shown in figure 1 because of the inclusion of the fatter White subjects. 
The average values at all 5 measured skin-points have been plotted for 
all the summation of total skin-fold thickness intervals in which 10 or 
more individuals were represented in figure 2. The largest summation 
interval used in this figure (130 to 139mm.) represents approximately 
20% body fat. The actual number of individuals used in the calculation 
of the mean skin-fold values for each summation interval in figure 2 
are given in appendix A. 

There are several relationships to be seen in this figure. The most 
striking is the curvilinear relationship of the chest skin-fold to increasing 
total thickness. If this trend continues into the range of even greater 
obesity, it becomes second only to the abdominal site in importance. 
Both linear and second degree curve regression equations were calculated 
on the chest mean values and the standard error of the curvilinear 


equation was significantly smaller than that of the linear equation. This 
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curvilinearity may be caused by some peculiarity of this series, and, in 
any case, is not of much practical importance. 

Even more interesting is the difference in the slope of the lines across 
the range of obesity represented here. If a line perpendicular to the 
abscissa were drawn about midway on the summation scale, in figure 2, 
it could be considered to divide the series into those leaner than average 
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Fig. 2. CHANGES IN MEAN SKIN-FOLD VALUES WITH INCREASING TOTAL THICK 
NESS FOR TOTAL SERIES (2000 MALEs). 


on the left and those fatter than average on the right. The fatter group 
on the right half of the figure shows considerably different slopes than 
characterize the leaner group. The marked inclination of the abdominal 
line is readily apparent; this trend is present on the left half but is not 
as distinctive in terms of millimeters of skin-fold thickness. It is also 
apparent that the chest slope is roughly parallel to that of abdomen 
and both are at odds with the slope of arm and knee, while the slope of 
back skin-fold appears to have an intermediate position. This probably 
should be interpreted as illustrative of a differential development of 


subcutaneous fat with increasing obesity. The torso, especially on the 
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anterior aspect, has a larger capacity to store subcutaneous tissue and 
becomes increasingly important in fatter individuals, while the extremi- 
ties are limited in the physical space available for this tissue and, 
therefore, grow less important. If this situation also occurs when a 
large sample of men are arranged in ascending order of body density, 
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Fic. 3. PERCENTAGE RELATIONSHIPS OF THREE SKIN-FoLD THICKNESSES WITH 
INCREASING OBESITY (TOTAL SERIES). 


the use of any one multiple regression equation for calculating percent 
body fat cannot be entirely satisfactory. It probably makes little differ- 
ence in men of average fat values, but it must,decrease the accuracy of 
prediction on the very lean and the very fat. 

One additional graphic method of representing the relationships of 
the various skin-fold sites with increasing obesity has been used in figure 
3. In this graph the mean skin-fold values at abdomen, arm, and chest 
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have been expressed as percentages of the total thickness on the ordinate 
axis. ‘These percentages have been plotted against calculated percent 
body fat. Since the body fat data were derived from a multiple regres- 
sion equation of these three skin-folds, the values of fat and skin-folds 
cannot be independent of each other. 

A plot of percentage values such as shown in figure 3 maximizes any 
change in proportions at the three sites because as one site value rises 
some other must decline to preserve a totality of 100% for each interval. 
Only the calculated lines of best fit have been used in this figure to 
achieve a simplified graphic effect. The data from which the lines were 
calculated are given in appendix A. The lines of best fit for arm and 
chest are obviously curvilinear in this figure. Linear and second degree 
curve regression equations were calculated from the mean percentage 
values for each interval and the standard error of the arm and chest 
second degree equations were significantly smaller than the standard 
errors of the linear equations. 

The arm and chest lines in this figure form a reciprocal pair, similar 
to the crossover shown in figure 2. This presumably should be inter- 
preted in the same manner, namely, that with increasing obesity sub 
cutaneous fat is deposited in relatively greater quantities in the torso 


than in the extremities. 


SUMMARY AND CONCLUSIONS 


Two problems involving skin-fold measurements on a large series of 
young American men have been examined in this paper. The results 
cannot be considered definitive because no independent measure of 
average body density was available; the conclusions arrived at using the 
sum of the skin-folds can only be termed indicative. 

An analysis of Negro and White skin-fold relationships by comparing 
correlation coefficients between skin-fold sites and average skin-fold 
values for increments of obesity showed small and not statistically 
significant differences between these racial groups. It was concluded 
that the presen data do not show any reason why body fat on Negroes 


cannot be calculated from the published prediction equation based on 
White subjects. However, the comparison was limited by the scarcity 
of obese Negroes, and it is possible that the observed Negro deficiency 
of fat over the pectoral and triceps regions might become practicaliy 


4 + 


significant if very obese Negroes were compared to their White counter- 
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parts. This racial difference in the regional distribution of subcutaneous 
fat is an intriguing problem which should repay further investigation. 

A graphic comparison of the various skin-fold sites in the combined 
series of 2,000 men for presumed increments of increasing obesity indi- 
cated a change in the proportional representation of the average site 
values between the lean and fat subjects. The anterior torso areas, chest 
and abdomen, showed a rise in relative importance with increasing 
obesity at the expense of the extremity areas, t.e., the arm and knee. 
This was interpreted to be at least partially the result of physical space 
limitations in the extremities. It may form a source of error in 
predicting fat on all men of a comparable age group with a single 
prediction equation. 

This paper has only touched on a few of the interesting possibilities 
inherent in the skin-fold measurements of these 2,000 young men. Its 
major purpose was to validate, if possible, the calculation of body fat 
on the 361 Negroids for use in other analyses. It is believed that this 
has been accomplished, and a new and intriguing field of anthropometric 
racial investigation can be developed through the skin-fold measurement 
technique. 
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AGE DIFFERENCES IN THORACIC FORM AS 
INDICATED BY THORACIC INDEX 


BY EIJI TAKAHASHI AND HIROSHI ATSUMI 
Department of Hygiene and Public Health, Hirosaki University, Japan 


ARTIN (’28) and Mollison (’33) explain the flatter build of the 
M. human thorax, compared with that of other primates and mam- 
mals, by the change to upright position. This phylogenetic tendency 
to a flatter thorax is also seen in ontogenesis. The human thorax 
becomes flatter during the fetal and postnatal growth periods (Martin, 
°28). We have followed the transformation of the form of the thorax 
in Japanese in all the age periods of postnatal life. 

The present study is based on body measurements of 9369 inhabitants 
of the Tohoku area, especially Tsugara district of Aomori Prefecture in 
Japan. These individuals, from birth to 85 years of age, were measured 
for stature, weight, chest depth, breadth and girth, and some other 
measurements. Chest dimensions were taken at the end of expiration of 
tidal air. The percentage of chest depth to chest breadth is taken as 
the thoracic index, although Martin used the inverse. 

The subjects were mostly rural inhabitants. Children from 6 to 11 
years old were primary school pupils and those from 12 to 14 attended 
middle school; these are periods of compulsory education in Japan. 
The subjects from 15 to 17 were high school pupils and almost all those 
from 18 to 20 were also students. The adult group consisted of farmers, 
teachers, officials, and service workers, and the people over 60 years of 
age had had various careers and were mostly inmates of an institution 
for the aged. 

FINDINGS 

The findings are shown in table 1 and in figure 1. The average 
thoracic index of the newborn is 84 in both sexes. After birth the index 
decreases rapidly until in 4-year old children it is about 72 in boys and 
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TABLE 


Stature and Thoracic Index by Sex and Age 


(means and standard errors) 


Standing 
Age Number Height (cm) 

0-6 mos 74 63.4+0.49 
6-12 88 69.0+0.35 
1-2 yrs 142 76.30.38 
2-3 82 85.2+0.51 
3-4 83 91.2+0.51 
4 33 =100.2+0.92 
5 36 104.5+0.73 
6 115 109.1+0.36 
7 175) —-114.1+0.32 
8 181 119.9+0.37 
9 181 125.1+0.37 
10 212 129.4+0.34 
11 185 133.4+0.36 
12 168 138.5+0.49 
13 321 144.6+0.42 
l4 258 151.4+0.46 
15 506 158.2+0.31 
16 434 162.0+0.26 
17 377 163.2+0.28 
18 322. 164.4+0.28 
19 289 165.5+0.3 
20 191 165.1+0.38 
15—19* 91 161.3+0.68 
20-24 170 164.9+0.39 
25-29 27 164.7+0.53 
30-39 179 162.4+0.48 
40-49 158 162.3+0.54 
50—59 14] 159.6+0.7]1 
60-69 18 (154.0+1.60) 
70 & over 67) 


26 (152.51. 


N. B. 


MALE 


Thoracic 


Index 





83.88+1.37 
80.28+0.76 
78.24+0.44 
76.72+0.57 
74.02+0.45 
72.24+0.79 


71.88+0.66 
72.14+0.37 
72.16+0.33 
71.46+0.16 


71.52+0.33 


71.12+0.30 
71.76+0.34 
71.90+0.33 
71.64+0.25 


72.16+0.26 


70.13+0.24 
69.20+0.25 
68.98+0.23 
70.08+0.30 
68.92+0.30 
68.80+0.43 


69.20+0.50 
68.52+0.44 
67.92+0.59 
69.48+0.39 
70.24+09.47 
72.18+0.53 
76.67 +2.06 
80.38+1.68 


71 
86 


FEMALE 


Standing 


Number Height (cm) 


62.1+0.47 
68.2+0.44 
74.3+0.33 
82.1+0.53 
90.7+0.43 


98.5+0.98 


103.5+0.76 
108.8+0.36 
113.6+0.37 
118.3+0.35 


123.9+0.37 


128.6+0.43 
133.8+0.47 
141.6+0.55 
145.1+0.39 
148.8+0.32 


151.2+0.25 
152.9+0.27 
153.5+0.33 
154.3+0.39 
154.0+0.51 
153.2+0.64 


145.9+0.91 


(144.4+1.57) 


(140.2+1.61) 


Subjects from 6 to 11 years are primary school children, 





Thoracic 
Index 


83.72+0.94 
79.80+0.68 
77.96+0.60 
75.40+0.50 
72.61+0.66 
71.00+0.84 


70.70+0.82 
70.68+0.39 
70.38+0.41 
70.05+0.32 


70.34+0.36 


69.76+0.35 
69.18+0.32 
68.98+0.37 
71.06+0.28 
71.48+0.35 


71.21+0.27 
71.28+0.34 
.62+0.40 
.24+0.36 
.02+0.56 


.16+0.94 


~ 


m MO to 


.64+0.48 
.00+0.60 
760.60 
.98+0.50 
62+0.61 
.02+0.71 
82.05+2.43 
83.33+1.78 


“It bo bo bo 


from 12 to 


14 pupils of middle school, from 15 to 17 pupils of high school and from 18 to 20 


also mostly students. 


* 15-19 year old youths who do not attend school. 
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71 in girls. It seems that the decrease is greater in girls. The average 
thoracic index continues to decline in girls 4 to 12 years of age when 
it is about 69, significantly lower than in boys of the same age. In 
other studies of Japanese (figure 2) and in American Negroes (figure 3) 
the thorax is also flatter in females of these ages. 
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Fic. 1. THoracic INDEX CURVE OF JAPANESE IN TSUGARA DISTRICT. 


The thoracic index of girls age 13 appears to be suddenly larger (to 
about 71) while the index of the males goes down from 15 to 25-29 
years of age (to about 68) so that the male chest is flatter from between 
14 and 15 years of age onward. From about the age of 25 to 29 chest 
index begins te be larger again, increasingly so after the 40-49 year 
old period and this is apparent in both sexes. In the age class of 70 
and over (really 70-85 years of age) the index is again over 80 in both 
sexes as in the newborn. 


DISCUSSION 


The chest depth and breadth of Japanese have also been measured 
by some other authors (figure 2), but not for all age classes. Differences 
in the thoracic index reported by different authors seems to be partly 
explained by differences in the province, occupation or rural-urban 
nature of the population studied; it may also represent differences in 
technique involving different phases of respiration or a different level 
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of the chest in the measurements of different authors. In any case, 
all authors report the same tendencies as to sex and age differences. The 
other studies of Japanese accord with ours in age groups up to 11 or 12 
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Fic. 2. THoracic INDEX CURVE OF JAPANESE SCHOOL CHILDREN BY OTHER 

AUTHORS. 

Solid line: male; dotted line: female 

(a) (b) (Urban) MrNnayosnt, |. 1928, Nippon Gakko Eisei (in 
Japanese), 10: 721-742. 

(c) (d) (Rural) Mrnayosui, T. [hbid. 

(e) (f) Suma, S. 1931, Nippon Gakko Eisei (in Japanese), 19: 
199-204. 

(g) (h) Désono, 8. 1946, Essays of Anthropology, Human Genetics 
and Constitution (in Japanese), Keid Gizyuku Univ., 6: 
1-115. 

(i) (j) InaGak1, T. AND A. NAKAZAWA 1947, Ibid., 11: 55-165. 


years of age, but Martin’s (’28) data on Negroes is most similar to 
ours in all age classes. All groups seem to show that the rounder chest 
of the newborn becomes increasingly flatter until about 4 years of age, 
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that before puberty girls have relatively flatter chests than boys and 
that after puberty girls’ chests become markedly rounder. After 14 
years of age the period of compulsory education ends and few children 
of the less privileged classes attend school; there is, therefore, a selective 
factor in our series. Nevertheless, there seems to be a real tendency to 


further flattening in the chest of males until 25-29 years. The increased 


flattening of the chest in successively older persons up to this age seems 
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Fie. 3. THoracic INDEX CURVE RY SEX AND AGE By OTHER AUTHORS. 
Solid line: male; dotted line: female. 


(k) (1) Japanese prisoners, Dont, N. 1949, Essays of Anthro- 
pology, Human Genetics and Constitution (in Japanese), 
Keid Gizyuku Univ., 14: 15-206. 

(m) (n) American Negroes, MARTIN, R. 1928, Lehrbuch der 


Anthropologie, Gustav Fischer, Jena. 
to be a developmental phenomenon. Although chest depth also increases, 
the increase in chest breadth is more rapid. Martin (’28) says that 
chest breadth always increases at the cost of chest depth. Chest breadth 
shows a significant positive correlation with stature, but chest depth 
does not (table 2). The greater relative depth of the female chest after 
puberty seems to represent a real sex difference; it is, of course, at this 
time that there is a rapid development of the female breasts. 

Many authors agree with us in noting a cessation of growth in chest 
breadth after ¢ 
depth. 


30-39 years of age but a continued slow increase in chest 
The thorax thus becomes rounder, but this may be a regressive 
rather than a developmental phenomenon. 
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In order to analyze the thoracic index further, we calculated sepa- 
rately that of adults of different occupation groups (table 3). In both 
males and females the lowest index is found in teachers and progres- 
sively higher indices are found in officials, service workers and farmers 
respectively. This same tendency is seen in the different age periods 
considered separately, and, conversely, the tendency to an increase in 


the thoracic index with age occurs in each occupational group. 


TABLE 2 


Coefficients of Correlation between Diameters of Chest, Thoracic Index, Relative 
Transverse Area of Chest, and Other Physical Measurements of Adults 


RELATIVE 
TRANSVERSE 


CHEST CHEST rHORACIC AREA 
DEPTH BREADTH INDEX OF CHEST 
Male Female Male Female Male Female Male Female 
Standing height 0.040 0.017 0.245** 0.130* 0.184** 0.121* 0.106** 0.165** 
Sitting height 0.022 0.025 0.250** 0.158** —0.193** —0.122* 0.104** 0.137** 
Girth of chest 0.583** 0.622** 0.628** 6.600** 0.087* 0.110% 0.695** 0.679** 
Body weight 0.544** 0.563** 0.616** 0.574** 0.009 0.198** 0.628** 0.646** 
Pelidisi 0.494** 0.504** 0.390** 0.421** 0.193** 0.190** 0.567** 0.809** 
Vital capacity 0.130** 0.167** 0.372** 0.327** —0.037 0.026 0.311** 0.276** 
I b 
Strength of 
back muscles 0.110* 0.149* 0.230** 0.247** 0.036 0.003 0.259** 0.419** 


N. B. Number of 


measured adults: 


* Over the level of significance of P 0.05 


** Over the level of significance of P 0.01. 


656 males and 350 females 








The relative thickening of the chest in older age may be caused 
by the increased spinal curvature or by the decrease in height of the 
thorax which follows the loss of fluid and compression of the disct 
intervertebrales. 

The thorax is composed of the spine (vertebrae), ribs and sternum. 
The spine is the most stable of these three elements and the motility 
of the ribs increases from the vertebral to the sternal end. Chest breadth 


is less affected by movement than is chest depth. The difference be- 


tween measurements on inspiration and on expiration is less for chest 
breadth than for chest depth (table 4). Furthermore, chest depth tends 
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TABLE 3 
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Chest Depth, Breadth and Thoracic Index by Sex and Occupational Group 


(means and standard errors) 





Male 


Chest depth 


(em) 


Chest breadth 


(cm) 


Thoracic ind 


Female 


Chest dept h 


(cm ) 


20-29 
30-49 
50-59 
Average 
20-29 
30-49 
50-59 
Average 
20-29 
ex 30-49 
50-59 


Average 


20-29 
30—49 
50-59 
Average 


20-29 


Chest breadth 30 49 


(cm) 


50-59 


Average 


Thoracic index 30-49 


Average 


TEACHERS 


OFFICIALS 


SERVICE 
WORKERS 


FARMERS 





18.03+0.11 
18.64+0.12 
18.99+0.25 
18.55+0.06 
26.63+0.12 
26.99+0.13 
27.04+0.26 
26.89+0.04 
68.08+0.49 
69.20+0.52 
70.06+1.12 


69.11+0.28 


16.91+0.14 
17.00+0.18 


16.96+0.08 


23.50+0.08 
71.46+0.52 


72.88+0.76 


12+0.39 


18.15+0.15 
18.63+0.16 
19.32+0.24 
18.70+0.07 
26.45+0.15 
26.88+0.16 
26.94+0.24 
26.76+0.05 
69.84+0.05 
69.64+0.66 
72.00+1.11 
70.49+0.34 


16.53+0.20 
17.50+0.18 


17.02+0.10 


23.41+0.21 
23.19+0.24 


23.30+0.10 


74.24+0.70 


72.33+0.49 


* Data for the age class 50-59 are excluded. 


18.44+0.23 
19.10+0.30 
19.30+0.25 


18.95+0.07 


69.88+0.91 
70.92+0.94 
72.46+1.15 


71.09+0.37 


17.80+0.17 
18.21+0.17 


(18.47+0.26) 
18.02+0.09* 


24.44+0.12 
24.48+0.15 


(24.12+0.27) 


24.46+0.07* 


72.76+0.73 


74.50+0.61 


(76.58+1.11) 


73.57 +0.38* 


18.98+0.21 
19.15+0.21 
19.40+0.28 
19.16+0.08 


27.11+0.21 
27.06+0.20 
26.40+0.34 
26.86+0.07 


70.14+1.04 
71.02+0.97 
73.54+1.16 
71.64+0.38 


17.94+0.25 


18.52+0.21 


(18.92+0.25) 


18.23+0.09* 


24.56+0.28 
24.34+0.31 

24.30+0.30) 
24.45+0.08* 


73.16+1.05 
76.10+0.92 


(77.91+1.05) 


74.60+0.40* 














72 EIJI TAKAHASHI AND HIROSHI ATSUMI 


to be more variable; for example, the difference in mean chest breadth 
between male teachers and farmers (as shown in table 3) is slight and 
not significant but the difference in chest depth is statistically significant. 

As sitting height decreases with age and thoracic contents do not 
decrease correspondingly, there must be an adjustment in the breadth 


TABLE 4 


Comparison of the Difference between Chest Depth and Breadth 


in Deep Respiration 


IN DEEP IN DEEP 
INSPIRATION EXPIRATION DIFFERENCE 
AGE NUMBER (cm) (em (em) I—E x 100 
CLASSES MEASURED (1) (E) (I—E) 4(1+ EF) 


Chest Dept h 


20-29 315 19.46 17.41 2.05 11.12 
30-39 130 19.64 17.50 2.14 11.52 
40—49 59 20.14 17.92 2.22 11.66 
50-99 31 19.77 17.94 1.83 9.70 
Average 535 19.59 17.53 2.06 10.92 


Chest Breadth 


20-29 315 26.49 24.91 1.58 6.14 
30-39 130 26.67 94.95 1.72 6.66 
40-49 59 26.56 25.03 1.53 5.93 
50-59 31 25.71 24.51 1.20 4.77 

1.56 6.07 


Average 535 26.47 24.9] 


N.B. Calculated from the data of DOHI on male prisoners (Essays of 
Anthropology, Human Genetics and Constitution, edited by Dr. Taniguchi, Vol. 14, 


1949, Keid Gizyuku University, Japan. In Japanese) 


and depth of the thorax. The ribs themselves and the thoracic girth 
increase little, if at all, and the increase is in depth rather than in 
breadth. The increased area of the thoracic cross section is achieved 
by the change from an ellipse towards a circle. 

In Japan there are many people with marked spinal curvature which 
is not caused by any disease, but can be ascribed to the way of life and 
work—for instance, rice farming by the Japanese method, sewing and 
sitting on floor mats in the Japanese manner. Table 5 shows the 


correlation between senile curvature of the spine and roundness of the 


thorax. The spinal curve in the sagittal plane from the seventh cervical 





h 


h 
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to the fourth lumbar vertebra was copied with a lead and plastic foe 
drawing curve ruler. The index of spinal curvature is expressed as the 
ratio of the length of the perpendicular (height of the curve) to the 
length of the cord. We calculated the coefficients of correlation between 
age, stature, thoracic index and index of spinal curvature on the basis 
of investigations of 58 senile persons housed in an asylum. The cor- 
relations of all of these factors are statistically significant except for 
age with thoracic index. The correlation between thoracic index and 
spinal curvature is especially marked and is demonstrable in the second 


TABLE 5 


Correlation Coefficients among Age (1), Stature (2), Index of Spinal 
Curvature (3) and Thoracic Index (4) in Old People 








fxn = — 304 Tr, = +.398 rr, =+ 204 
lin3 = —-152 Tis3 = + .306 Tun = + .145 
Tig = — 235 Tine = + .326 lus = +.391 
Tio. = —-114 Tis.06 = + 273 Tus.ss = + .283 
Tse 453 Tf. =—.380 Ty = + .554 
fan = .380 Tu.. = — 331 Yo, == -+- 513 
fa. = 314 Tus =—.174 Tua = + 463 


Tes.16 = — -258 Toe.18 = — -136 T.19 = + .498 





order ; that is, it is siginficant even when the influences of age and stature 
are excluded. In senile persons with markedly crooked spines the chest 
depth is sometimes greater than the chest breadth as in most quadrupedal 
mammalia. These extremely bent individuals walk with the aid of a 
stick holding the upper part of the body parallel to the ground in much 
the same manner as a quadruped. If the influence of the gravity of the 
thoracic contents is deduced to explain the form of the chest in quadru- 
peds, it should be considered in the case of such stooped senile indi- 
viduals also. 
SUMMARY 


Studies of the thoracic form in 9,369 rural inhabitants of the Tohoku 
area in Japan indicate certain developmental and retrogressive changes. 
The thorax of the newborn is rounder in transverse section but flattens 
increasingly until 4 years of age. In childhood, girls’ chests are flatter 
than those of boys, but at the time of puberty, girls’ chests suddenly 
become rounder and this is followed by a continuing tendency to greater 
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roundness with age. The chests of boys continually become flatter until 
the 25-29 year old period. These changes seem to be developmental. 
Despite differences in the thoracic index between occupational groups, 
all such adult groups show a gradual rounding of the thorax after the 
age of 30 and the change accelerates with age. Two reasons have been 
considered as a cause of the rounding of the thorax with age: decrease 
in stature, especially in sitting height, and senile curvature of the spine. 
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RELATIVE FAT PATTERNING: AN INDIVIDUAL 
CHARACTERISTIC * 


BY STANLEY MARION GARN 


Fels Research Institute, Antioch College 
Yellow Springs, Ohio 


INTRODUCTION 


NTHROPOLOGISTS, clinicians and biometricians familiar with the 
A appearance of nude subjects or experienced in viewing body-build 
photographs have long been impressed by marked individual differences 
in subcutaneous fat patterning. Such differences, equally apparent when 
soft-tissue radiographs are reviewed, do not always escape notice even 
when subjects are fully dressed. A number of seemingly different fat 
“ patterns ” may be intuitively identified in even a small series of persons 
of like sex, constant age, and from a single racial group. 

The medical and anthropological literatures are well stocked with 
descriptive terms for different subcutaneous fat patterns. Without at- 
tempting to list all of them, and excluding pathological conditions 
like adiposis dolorosa, and such unusual depositions of fat as characterize 
steatopygia, certain patterns have been listed repeatedly. They include 
an “upper extremity pattern,” a “girdle fat pattern,” and a “ lower 
extremity pattern.” Variants in thoraco-abdominal fattening have been 
described as “D” shaped and “d” shaped. Further we read of a 
“juvenile fat pattern,” an “adult fat pattern” and a “ Rubens fat 
pattern ” (Martin ’28, Sheldon et al., 40, Angel ’49, Skerlj et al., 53). 


* Section 3 of this paper is based on data obtained in the course of investi- 
gations carried out in collaboration with Dr. Josef BroZek at the Laboratory of 
Physiological Hygiene, University of Minnesota, and supported in part by funds 
provided under a contract between the regents of the University of Minnesota 
and the Department of the Army (contract No. DA 44-109-gm-1526). Permission 
to use the data was obtained from the Quartermaster Food and Container 
Institute for the Armed Forces and the Medical Nutrition Laboratory, United 
States Army. The views or conclusions contained in this report are those of 
the author. They are not to be construed as necessarily reflecting the views or 
endorsement of the Department of Defense. 
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While such designations are often devastatingly good, they lack 
quantification. depend much upon the judgment and preferred categories 
of the viewer, and may reflect in part configurations of the structures 
underlying the tela adiposa (cf. Skerlj et al., 53, p. 593, Reynolds 751, 
p. 152). More important they may represent simple nutritional variants. 
As we have previously shown, the rate of fat deposition differs on 
different parts of the body so that (in absolute terms) the same indi- 
vidual may exhibit a different fat pattern when thin than he does when 
fat (Garn *54). Similarly, older and younger men may differ in the 
pattern formed by the absolute fat thicknesses not because of an age 
factor, but simply because the older man is fatter (Garn and Harper ’55). 

If, however, fat patterning is dealt with not in absolute terms, that 
is, the measured thicknesses themselves, but in terms of their z score 
transforms, pattern differences due solely to the amount of fat are 
eliminated. In this way it is possible to investigate interpersonal differ- 
ences in fat patterning duly compensated for differences in the nutritive 
state. This technique of using z score profiles, which has had extensive 
use in psychometrics under the name of “ profile analysis,” seems highly 
applicable to the study of what we shall call relative fat patterning. 

It is the purpose of this study to investigate the extent of inter- 


“ 


individual differences in “ relative fat patterning.” We shall indicate 
the degree to which individual males depart from the pattern expected 
for their degree of fatness. We shall demonstrate the existence of 
specific relative fat patterns in a series of adult males. And finally, 
we shall present evidence that individual relative fat patterns persist 
during weight loss, thus indicating that the relative fat pattern is an 
individual characteristic with some degree of permanence. 


METHODS AND MATERIALS 


The present study of relative fat patterning in the adult male 
is based on micrometer-caliper measurements of subcutaneous fat 


“shadows” as seen in soft-tissue X-rays of 94 adult males. The 


2The z score is (X Y)/o. In the present study the term “absolute fat 
pattern ” as it applies to individuals refers to the pattern formed by the absolute 
or measured fat thicknesses, while the term “relative fat pattern” is used to 
designate the individual’s z score profile. The absolute fat pattern may also be 
thought of as the “overt fat pattern,” since it is this aspect of patterning that 
is perceived by the trained observer, while the relative fat pattern may be con- 
sidered to be the “covert fat pattern,” since it is ordinarily evident only after 


converting the absolute measurements into z scores. 
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roentgenographie techniques and the location of the fat thicknesses 
measured are as described by Garn and Saalberg (753), Garn (754) and 
Garn and Harper (755). 

Eighty-one of the subjects were adult male participants in the Fels 
Longitudinal Studies and were for the most part fathers of Fels 
children (Garn *54). Thirteen of the men were subjects in an investiga- 
tion of controlled weight loss, conducted at the Laboratory of Physiolog- 
ical Hygiene, University of Minnesota (BroZek et al., 55). These latter 
men were X-rayed by the author and Dr. Josef Brozek during a main- 
tenance period and again at the end of a period of restricted caloric 
intake, employing the same roentgenographie procedures used on the 
larger series. 

Each of the 9 measured fat thicknesses for each subject was converted 
into the corresponding z score or standard score value (see Guilford, 
°36) using the normative data published earlier (Garn ’54).? In this 
way the 9 fat measurements, which differ markedly in thickness and 
absolute variability, were placed upon a comparable basis. Z scores, it 
may be recalled, have a mean of 0 and a standard deviation of 1 
(Guilford *36, p. 86, Edwards, *47, p. 47); thus a hypothetical indi- 
vidual exactly 1 sigma above the mean in all 9 fat measurements would 
be represented by a z score value of + 1 throughout. The appearance 
of such z score profiles is depicted in figure 1. 

The extent of variability within an individual was measured by the 
statistic oz. This is simply the standard deviation of the individual’s 
own z scores for subcutaneous fat. For the hypothetical individual 
described above oz would be 0 since all of his z scores were identical. 
Individuals more variable in relative fat patterning might have oz values 
of 0.5, 1.0 or even 1.5. oz therefore measures the extent of individual 
variability in subcutaneous fat patterning, corrected for the amount of 
fat present. 

However, while oz provides a numerical measure of relative fat 
variability, it cannot of itself indicate where an individual departed from 
the general pattern. Accordingly, those individuals with above-median 

2 The use of 2 scores implies either that the distributions are normal or of the 
same form. This is not strictly true, in the present case, since x? tests of the 36 
combinations reveal a significant degree of heterogeneity of the distributions, 
with the distribution of hip fat somewhat different from 5 of the 8 others. 
However, this does not affect the general nature of the findings and especially the 
pattern comparisons. For a short discussion and further references see McNemar 
(’49, p. 38). 
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oz values were hand-sorted, and their z score values plotted on suitable 
coordinate paper, as shown in the lower part of figure 1. In this way 
it was possible to analyze, and in some cases categorize the diverse 
relative fat patterns making for high oz values. The “pattern index” 
technique developed by Lacey and Van Lehn (752) was followed in part. 
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Fie. 1. Comparison of absolute fat patterns (above) and corre 
sponding relative fat patterns (below). Left, hypothetical indi 
vidual exactly at the mean in all 9 fat thicknesses. His mean z 
and fat variability score (oz) are both 0. Right, hypothetical 
individual with relatively thick arm fat and relatively thin leg fat. 
Though the difference between the two men is evident in the 
absolute fat patterns, it is striking in the relative patterns and 
in the fat variability scores. 

The abbreviations: DI, MA, LA, I, TR, PL, AL, ML, and LL 
refer to deltoid insertion fat, medial arm fat, lateral arm fat, 
iliac fat, trochanteric fat, posterior leg fat, anterior leg fat, 


5 =] 


medial leg fat and lateral leg fat. 


Finally the permanence of such z score fat patterns, or more precisely 
their resistance to change during weight loss, was investigated in the 
Minnesota group. Here the z score profile patterns before and after 
weight loss for each individual were plotted and compared. The degree 
of similarity or fit between the “ before” and “after” patterns was 
expressed numerically as odz, the standard deviation of the z score 
differences. Like oz, odz also has a theoretical lower limit of 0, and 
no upper limit. A odz of 0 would be attained if an individual in losing 


fat lost the same amount (in z score units) in all 9 fat thicknesses 
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measured. Thus odz measures the degree of similarity between z score 
profile patterns, and may be called the profile similarity indez. 

These procedures which deal with z scores, rather than absolute fat 
thicknesses, are made necessary by the facts that the different fat 
thicknesses measured differ considerably in their mean values, and that 
the rate of fat deposition or withdrawal is non-uniform over different 
parts of the body. 


FINDINGS 
1. Relative fat variability scores (oz) 
To | 


subcutaneo 


egin with, the standard deviation of individual z scores for 
is fat (oz) was calculated for each of the 81 males in the 


basic series. As shown in table 1, the value of oz ranges from a low of 
0.26 to a high of 1.45. The lowest value observed approaches the 


TABLE 1 


Distribution of Relative Fat Variability Score (oz) for 81 Adult Men 


RANGI \O. PERCEN1 CUMULATIVE PERCENTAGE 
0.00-0.19 0 0 0 
0.20-0.39 1] 14 14 
0.40-0.59 37 46 60 
0.60-0.79 18 22 82 
0.80-0.99 10 12 94 
1.09-1.19 2 2 96 
1.20-1.39 l l 97 
1.40-1.59 2 2 99 
1.60-1.79 0 0 99 


The mean is 0.61 and the median is 0.56. 


theoretical minimum of 0.0, and indicates a high degree of z score 
uniformity, and an almost total absence of relative fat patterning. The 
high values, on the other hand, are indicative of considerable internal 
variability, disproportion or “ dysplasia ” in subcutaneous fat deposition. 
Since oz is a new statistic in this work, it is of interest to note that the 
mean was 0.61, and the distribution considerably skewed, with a Pear- 
sonian coefficient of skewness (Sk) of 0.75. While such skewness is to 
be expected where fat is concerned, it indicates that the median rather 
than the mean oz should be employed as a measure of central tendency. 
Accordingly, in selecting individuals with high oz values, for the second 
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part of the study, selection was based on above-median rather than 
above-average o2Z scores. 

Though, anthroposcopically, older men seem to be more variable in 
absolute fat patterning, there was no evidence that age alone influenced 
the relative fat variability scores. The mean oz for 40 younger men 
did not differ significantly from the mean for 41 older men. As a 
confirmation the product moment r between age and oz failed to attain 
significance at the 59% level. In like fashion, thin and fat men did not 

TABLE 2 


Fat Variability Scores (cz) for Older and Younger and Fatter and Thinner Men 


GROUP NO. MEAN AGE oz RANGE MEAN ¢ 3. D 
Total group gl 39.8 0.26-1.45 0.61 0.24 
Younger men 28 30.4 0.26-1.08 0.62 0.21 
Older men 43 48.1 0.32-1.45 0.61 ? 0.26 
Leaner men? 41 39.5 0.28-1.45 0.57 0.25 


Fatter men ? 37 40.8 0.32-1.43 0.66 8 0.25 


1 Trochanteric fat below 16.6 mm. 
7 Trochanteric fat over 16.6 mm. 
* Differences between means not significant at 5% level. 


differ significantly in mean oz (table 2). However, a low, but positive 
and significant correlation of 0.32 between trochanteric fat and oz 
suggested at least that fatter, or fatter-hipped, men may be slightly 
more variable in relative fat patterning. Under these circumstances, 
some degree of caution may be required, before saying that the very fat 
are more “ dysplastic ” than the very thin. 


2. Relative fat patterns 

Since a given value of oz merely expresses the degree of variability 
in the individual’s z scores for fat, without explaining the source of the 
variability, it was of interest to explore the various relative fat patterns 
in order to determine whether there were “upper extremity,” “lower 
extremity” and “girdle fat patterns,” etc. Therefore the z score values 
for subcutaneous fat were plotted for all men above the median az, 
and the z score profile patterns were examined further. 

Of the 40 men so plotted, it was not possible to describe or “ name ” 
the pattern anatomically in 23 cases. That is, these 23 men high in oz 


were by inspection high or low in individual fat thicknesses (medial arm, 
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posterior leg, etc.) or showed highly localized highs and lows. In 17 
men, however, the patterns involved anatomical regions, with deviations 
up to two z scores from their own mean z and ranges up to 4 z scores. 
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Fic. 2. Twelve representative z score profile patterns for fat. A, near-absence of relative 

fat patterning. B, lower segment pattern. C and D, leg fat patterns. E and F, arm and 

leg patterns. G, example of a pattern not easily categorized. H, iliac fat pattern. I, arm 

fat pattern. J, arm-trochanter fat pattern. K, low hip-fat pattern (— 2¢ below the indi- 

vidual’s mean). L, high hip pattern (1.90 above individual’s own mean). 


The abbreviation DI, MA, LA, etc. refer to specific fat thicknesses as in figure 1. 


Such cases where lower leg fat or arm fat responded as a whole were 
simpler to name. 

In 5 men there was a clearly-marked leg fat pattern characterized 
by elevated z score values for the entire leg region. Examples are shown 
in figure 2, ec and d, wherein it is evident that a leg fat pattern 
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(averaging two or more z scores above the individual’s average z) may 
be found both in men who are thin and those others who are fat. 
Three additional men showed a combined arm and leg pattern, giving 
their z score profiles a notably sinusoidal appearance. One man, out 
of the 40, evidenced a combined pelvis and leg pattern, a combination 
rarely to be expected on a chance basis alone. 

In all, 3 men exhibited the kind of concomitant trochanteric-iliae 
fat excess, that could be called a “girdle fat pattern.” However, it 
should be noted that the z scores correct for the usual tendency to deposit 
fat differentially in this region, so that these 3 men are exceptionally 
“hippy ” specimens. The individual pictured in figure 2, h, is average 
in his total fat thickness but markedly deviant in iliac fat. 

Other nameable patterns include an arm-hip pattern (figure 2, i), a 
true arm pattern (figure 2, j}), and a pattern involving low iliac fat 
(figure 2, k). The individuals shown in figure 2 k and | are quite 
exceptional in that their iliac fat is nearly 2 standard scores out of line 
from their own z score mean. 

Therefore it is evident that there are a number of relative fat 
patterns contributing to an above average oz, and characteristic of 
individuals. While some of these relative fat patterns can be described 
by terms previously used (lower-extremity, upper-extremity ete.) over 
half of them are not easily described in words. Under these circum- 
stances the z score profiles, or better, ¢ score profiles, may prove more 
useful in their original form than when further categorized. To put 
it differently, there are more, and more complicated, relative fat patterns 
than we can conveniently force into categories. 


? 


8. Permanence of the relative fat pattern 

While it is evident that there are marked interpersonal differences 
in relative fat patterning, one may well ask whether any given relative 
fat pattern possesses permanence, and persists despite changes in nutri- 
tional status. Fortunately data on the 13 young men from the weight 
loss series allow us to answer this question with some degree of assurance. 

As shown in figure 3, 1 through 13, the individual z score profile 
patterns after weight loss closely resemble the initial patterns and in 
several cases are practically identical (figure 3, 1, 2,3, and 12). In none 
of the 13 cases did foed restriction cause a reversal of pattern or even a 
marked change. Interestingly, the tendency for the individual relative 
fat patterns to hold, despite generalized loss of fat, is as marked in 


those men who were initally obese as among those who were lean even 


before food restriction. 
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Fig. 3. 1-13; Relative fat patterns of 13 young men before 


and after a period of food 


restriction. Despite generalized fat loss, as shown by the lower level of each “ after ” pattern, 


individual pattern profiles maintained their identity. 


A and B; hypothetical patterns illustrating (A) _ perfect pattern congruence and (B) 


pattern dissimilarity with several reversals. 
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However, while the “before ” and “after” z score profiles provide 
striking visual proof of the resistance of the relative fat pattern to 
changed nutritional status, the plotted patterns do not afford a quantita- 
tive measure of pattern similarity. Therefore, quantification was accom- 
plished by computing odz which is the standard deviation of the 
differences expressed in z score units. As pointed out previously, and as 
illustrated in figure 3, A and B, if two z score profile patterns are 
identical, the value of odz will be 0. odz, therefore, measures the 
degree of profile similarity. 

As shown in table 3, the observed values of odz (the profile similarity 
index) range from 0.20 to 0.50 for the 13 men in the weight reduction 
series, with a mean of 0.39 -+ .03 for the group as a whole. In contrast 
the values of odz for 13 randomly selected pairs of men were much 
higher, ranging from 0.50 to 1.36 and the mean edz for these randomly 
selected pairs was 0.77 + .07. Thus an individual continues to resemble 
himself in the relative fat pattern, despite considerable and rapid weight 
loss. 

Since all of the men in the weight-loss series had lost fat during the 
course of the experiment, there were possible objections to the use of 
random pairs for comparison, even though the actual pairings were made 
by a disinterested member of the Fels Institute staff. Accordingly, a 
second set of pairs was set up, matched on a weight basis to the starvation 
series men before and after food restriction. Here, the values of odz 
ranged from 0.41 to 1.47, with a mean of 0.67 + .07. These results were 
substantially similar to those mentioned above. 

These particular findings may be summarized by saying that indi- 
vidual relative fat patterns hold their shape during weight reduction 
even though the measured fat thicknesses (1. e., the absolute fat pat- 
terns) change. The use of odz makes it possible to demonstrate that 
the relative fat patterns of an individual under two different conditions 
of nutriture are more alike than the relative fat patterns of two different 
individuals, whether selected on a weight basis or not. 


DISCUSSION 
In this communication we have returned to a consideration of the 
individual, the prime unit of constitutional research, though limiting 
ourselves to the study of individual differences in one tissue layer (the 


tela adiposa), one sex and age group (adult males), and to purely 


objective measurements. We have shown that it is possible to go beyond 
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the absolute fat patterns, which are highly responsive to changes in 
caloric intake, to the relative fat patterns, which appear to be of a 
more permanent nature. 

The measure oz, the standard deviation of the individual’s own z 
scores for fat, provides a direct measurement of relative fat variability. 
oz, which has a lower limit of 0, and no upper limit, measures the 
extent to which the individual departs from a straightline z score 
profile. Since z score units are used, and variability is measured about 
the individual’s own mean z, age does not enter into the picture, and for 
the most part differences in the degree of obesity may be ignored. A 
male with a oz greater than 0.61, or better 0.90, may be considered 
dysplastic (in regard to distribution of fat) whether fat or lean. 

While a high oz indicates that an individual does have an unusual 
fat pattern, it cannot of itself indicate why or where. Here, actually 
plotting the z score profiles for fat becomes a necessity. As shown by 
the analysis of individual z score profiles, there are a number of distinct 
and individually characteristic relative fat profiles. Some of them can 
be categorized as “‘ arm-fat,” “leg-fat,” “arm and leg fat,” ete. On 
the other hand, there are a larger number, quite distinct in the profiles, 
that cannot be categorized, or categorized in simple language. And, 
since our interests extend well beyond the level of simple description 
there is no need to do so. 

The probability that a given relative fat pattern is an individual 
characteristic, possessing some degree of permanence through time, is 
heightened by the findings in the weight reduction group. Here all of 
the men lost weight (up to 14 kg) and all lost fat in all 9 fat layers 
measured. Consequently the z score patterns were all displaced down- 
ward at the end of the weight loss period. Yet the individual patterns 
tended to “hold.” The extent of similarity between the individual 
“before” and “after” patterns was striking, and there was little 
difficulty in recognizing each individual from his post-starvation z 
score pattern. However, the permanence of such patterns has not yet 
been demonstrated over longer time periods, or under different nutri- 
tional regimens. 

Still, it is important to point out the utility of the relative fat 
pattern concept. As is well known, absolute fat patterns undergo 
qualitative changes dur 
changes thereafter. Under these circumstances, permanence of type 


ine adolescence and less striking quantitative 


cannot be sought either in the absolute fat thicknesses, or in the patterns 
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formed by the absolute fat thicknesses. Relative fat patterning seems to 
afford the only approach both to the problem of permanence of type 
and the problem of genetic determinants, as far as the tela adiposa is 
concerned. 

Previous workers have demonstrated a general tendency for children 
to hold position, either on a rank basis or a percentile basis, in the 
thickness of specific fat layers during the growing period (Stuart e¢ al., 
"40, Stuart and Sobel °46, Reynolds 51). While such constancy sug- 
gests a genetic determinant, it is equally likely that children maintain 
their eating habits over the years: a juvenile trencherman continuing 
to be a trencherman and a child with a bird-like appetite continuing 
his avian behavior. Reynolds (’51) further demonstrated the existence 
of sibling similarities in the thickness of the tela adiposa. Again, while 
the possibility of a genetic determinant cannot be ignored, the fact that 
siblings share a common diet should not be discounted either. Under 
these circumstances the relative fat pattern may prove of maximum use 
both in studying constancy of type and in searching for genetically- 
determined variants in the tela subcutanea. 

Finally, it should be pointed out that the relative fat pattern does 
not replace the absolute fat pattern. The absolute fat pattern comes 
closest to what we perceive. The absolute fat pattern is affected by 
age, sex, and nutritional status, and it is more than likely that the 
absolute fat patterns of different races differ to an important degree. 
The relative fat pattern, on the other hand, is much more of an indi- 
vidual characteristic and, in adulthood at least, is seemingly much more 
resistant to change. Both aspects of fat patterning are important 
adjuncts to research on the subcutaneous fat. Absolute fat patterns 
obviously relate to nutritional studies. Relative fat patterns, on the 
other hand, afford an opportunity for the investigation of the indi- 
vidual and are far more likely to find their origin in the germ plasm 
than in the “ groaning board.” 


SUMMARY 


1. Individual differences in relative fat patterning were investigated 
after converting absolute measurements into their corresponding z trans- 
forms. In this way patterned differences due solely to the amount of 
subcutaneous fat were eliminated. 

2. The measure oz (the standard deviation of the individual’s own 
z scores for fat) was used to indicate the extent of relative fat variability. 
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In 81 adult males oz ranged from 0.29 to 1.45 with a mean of 0.61. 
It was suggested that oz could serve as an objective measure of fat 
“ dysplasia.” 

3. As tested in various ways, oz was not affected by age and only 
slightly, if at all, by the total amount of fat. 

4. Individual relative fat patterns were investigated in the z score 

plots of 40 men with above-median oz values. A number of character- 
istic relative fat patterns were found, some of them resembling patterns 
described in the literature. In 17 men there were distinct and nameable 
relative fat patterns. However, in 23 cases, the relative fat patterns 
were not susceptible to simple naming. 
5. The possibility that such relative fat patterns are permanent and 
resistant to changes in nutritional status was investigated by comparing 
the z score profile patterns of 13 young men during a period of feod 
restriction. 

6. Visually, the tendency of an individual to maintain his z score 
profile pattern was striking. Using a quantitative measure of pattern 
similarity (odz) this impression was confirmed. 

%. It was observed that the relative fat pattern is an individual 
characteristic, having some permanence and resistance to change unde~ 
nutritional stress, and that the relative fat pattern (in contrast to the 
absolute fat pattern) may be genetically determined. 
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1. INTRODUCTION 


HE purpose of this paper is to introduce the general, time stationary, 
ti continuous, Markov process with a finite number of states, as a 
model for the follow-up study of a disease. A special case of such a 
process was introduced by Fix and Neyman (751) as a model for the 
study of cancer or other disease. 

In models used up to the present time cure rates or death rates 
have generally been used to measure the effectiveness of a treatment. 
These rates may be too coarse in that they overlook other important 
characteristics of the disease or treatment. They may also be not quite 
relevant in that the disease may cause death only indirectly and only 
after many years, or else the treatment does not cure in the sense of 
the patient becoming free of at least the gross symptoms, but only 
retards the progress of the disease. This is the situation with many, 
if not most, of the degenerative diseases. Under these circumstances, a 
medical scientist will most likely incorporate intuitive and non-quan- 
titative measures into his appraisal of the treatment. 

To introduce other quantitative measures of a treatment, it is 
necessary to consider more elaborate models of the temporal progress of 
the disease than have hitherto been used. One such elaboration, where 
it is appr¢ priate to do so, is to define several states that a patient may 
be in. For example, in studying arthritis we might define the following 
states: 1, active arthritis; 2, quiescent; 3, death from arthritis; 4, lost 
from observation or death from other causes. This might be enlarged 
by replacing state 1 with several states representing a range of activity 


from active to quiescent. The exact number of states would depend on 
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the purposes of the investigation and the ability of the investigator to 
discriminate between the states. Given such a set of states the patient 
who enters the study is regarded as traversing a path: he enters the study 
in some initial state and from time to time transfers from one state to 
another and eventually leaves the study in some final state. It is assumed 
that the paths are determined by a set of stochastic or probabilistic 
laws, t. e., the laws specify the probability of any particular path, though 
not the path itself. With the aid of such a model the disease process is 
studied as a stochastic process. The new measures which it is hoped 
are useful are derived from certain characteristics of the stochastic 
process. One such characteristic is the probability that a patient who 
enters the study classified in one particular state will, after 7 years, 
leave the study in some other, or the same, particular state. Such a T 
year transition probability, as it is called, is a generalization of the 
more familiar 7 year survival rate. If there are k& states, then there are 
k? such 7 year transition probabilities. As a second characteristic we 
have, for all patients who enter the study in one particular state and 
leave the study after T years, the average time spent in each of the states. 
As an illustration of this second characteristic, suppose a drug is being 
tested for its power of inhibiting the activity of arthritis. Then the 
average length of time which a person spends in a less active state of 
arthritis, who originally received treatment in a more active state, might 
be a measure of the effectiveness of the drug. 

The chief statistical problem is to estimate the set of parameters 
which defines the Markov process, by means of a sample of paths. The 
sampling scheme considered in this paper is one which consists of 
periodic observations on patients, that is, if a patient enters the study 
at time fo, he is observed at times ty + #, tp + 2t, to + 3t, ete., and only 
the information concerning the state of the patient at these times is used 
to estimate the parameters of the process. 

In section 2 a brief description of the Markov process used as a model 
is given. In section 3 certain useful characteristics of the process are 
defined. These sections are included for the sake of completeness only. 
A more thorough treatment of the mathematical aspects of the model 
is given, for example, by Doob (53), or Feller (750) both of whom give 
extensive bibliographies. In section 4, the estimates of the parameters 
(called transition rates) of the model are given. Estimates of the 
characteristics are then derived from the estimates of the transition rates. 
Section 5 treats of the large sample distribution of the estimates, and the 
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next two sections, 6 and 7, of tests of hypotheses and tests of goodness 
of fit of the model, respectively. In section 8 the model is generalized 
to two contiguous study periods. In section 9 an example is given. 
The terminology used in this paper refers to disease processes. As far 
as the theory is concerned, this is only a matter of convenience and is 


not meant to imply any limitation to the study of disease alone. 


2. A BRIEF DESCRIPTION OF THE MODEL 


We assume that during the course of a follow-up study a patient is 
at every instant of time in one and only one of & states which we label 
S,,S2,- - -,S,. The follow-up study is of length 7, which means that 
each patient is under observation for a period of time T following his 
entry into the study. The actual starting times of the patients are 
ignored and the entry times all set equal to zero. The total period of 
the study is therefore from t¢=-0 to t—T, and the fact that a patient 
is assumed to be in one of the & states at each instant of this interval 
of time makes the process, by definition, a continuous one. 

The phrase, under observation, in the context of this paper, does not 
mean strictly physical observation; it means rather, counted as being 
in the study and capable of being classified in one of the & states at any 
instant of time, while in the study. By this definition, for example, a 
patient who dies during the study remains formally under observation 
until his time 7’ has elapsed, if death is one of the states. We remark 
also that the times t=0 to tT need not, in practice, represent the 
actual length of the follow-up study. These two times may, for example, 
represent years 0 to 5, or 6 to 10, or any interval of a 20 year follow-up 
study. 

We are concerned with certain events called transitions. <A transition 
is an event of the following form: a patient at time f, is in state S; and 
at time ¢, > ¢, is in state S;, where « and j may be equal or unequal. 
Nothing is said about the state of the patient at any other time. Asso- 
ciated with these events are probabilities. If a patient is in state S; 
at any time ¢, we assume that there exists a conditional probability 
pij(t) that at a later time ¢, + ¢ the patient will be in state S;, for 
174=1, ‘,k and OS4,St,+tST. The probabilities p,;(¢t) are 
called transition probabilities. We have defined the transition prob- 
ability to depend only on the difference (¢, +-¢) { t and not on 


the actual time points ¢, and ¢,-+ ¢; by definition this is the property 


of stationarity. When making statements about pj,;(¢) for all values of t 
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in the interval [0,7] we will also refer to pi(t) as a transition prob- 
ability, in order not to complicate the terminology too much, though 
strictly speaking, pi(t) should then be called a transition probability 
function. 

The additional assumptions that characterize our model are as 
follows. When ¢—0, 


2-1) oe 1 if t= 
(2.1) Pu) —= 49 it Goey" 
The transition probabilities are also continuous with respect to ¢ at 
t{—0, 1.¢., 
; 1 i=j 
2.2) ] i(t) = nah 
( am pu?) 1 1} 


The last assumption is that our process has the Markov property. 
Loosely speaking this means that the conditional probability of the 
transition from S; to S; in time ¢ does not depend on the history of the 
patient previous to the time he was in state S;. More precisely, if p,(0), 
i—1,---,k is the probability of being in state S; at t—0, and 
0<t,<ti<t;<---<t,<T are any finite set of time points, then 
the probability that a patient is in state S;, at t= 0, S;, at t= t,,---, &, 
at t=, is 

Pig(9) ici, (41) Pinig (tz — t1) > + * Pin-sin (tn — Ens). 


The set of transition probabilities p,;(¢) are conveniently assembled 
into a transition matrix P(t) = {pi(t)}. We observe that when ¢t —0, 
P(0) is the identity matrix. Matrix terminology is very convenient for 
our presentation and it will be used extensively. 

It can be shown (Doob *53, pp. 238-243) that there exists a unique 
matrix Q such that for O=t=T, 

9 ¢ 2 ; 2 t? 3 t8 
(2.3) P(t) I+ Q+ Po +teOat:-: 
symbolically. @ does not depend on ¢; its elements are the instantaneous 


transition rates, as may be seen from the relation 


1 
lim ; (P(t) —I) =Q. 


t—0 


The elements of Q will therefore be called transition rates and Q the 


transition rate matrix, or simply rate matrix. @ together with the 
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initial states of the patients determines the process, so the primary esti- 


mation problem is to estimate Q. 
The tra sition rates have the following properties: 


(2. 4) qi =" i 
*ts0 i=; 
k 
(%. 5) Sqj=0 i—1,---,k. 
j=1 


Any square matrix Q with the properties (2.4) and (2.5) is the rate 
matrix of some process (of the type we are concerned with). This is a 
useful property enabling us to manufacture processes at will. 

In conclusion, the most important conditions which the disease 
process must satisfy, at least approximately, in order for the model to 
apply, are the following: 


1. During the period [0,7] of the study, the patient is at any time 


in just one of a finite number of clearly defined states, S,,- - -. Sx. 
2. The probability that a patient in state S; (t—=1,---,k) at any 
time during the study, is subsequently in state S; (7 =1,:--.k) ata 


later time in the study, depends on the two states in question but not 
on the previous history of the patient. 

3. The probability that a patient in state S; at time ¢, is in state S, 
at a later time ¢, depends on the difference t,—?#,, but not on the 


initial time f¢,. 


CHARACTERISTICS OF THE DISEASE PROCESS 


Starting with a given rate matrix @, which may be an estimate or 
else theoretically given, we can derive certain characteristics of the 
process. 

a. From (2.3) we can determine the transition probabilities for any 
time interval, in particular for the total period [0,7], by letting t — T. 

b. It may sometimes be desirable to find the transition probabilities 
corresponding, not to the given Q, but to a rate matrix obtained from Q 
by changing the values of some of the elements of Q, while at the same 
time satisfying the restrictions (2.4) and (2.5). As an important 
particular case, suppose the state Ss is the state of being lost from 


) 


observation. The rates gis (t.—1,: - -,k;t+48), representing rates of 
transfer from the other states to S5, will affect the transition probabilities 
in a way which, we usually assume, has nothing to do with the disease. 
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By substituting 0 for these gy’s and altering the diagonal elements of 
Q to satisfy (2.5), a new transition matrix can be calculated for the 
hypothetical situation in which no patients are lost from observation. 
In general, if the transition rates gag, 775," * *,Qu» have been suppressed 
in this way, then the altered rate matrix will be called the net rate 
matrix and written Qggj7s|---\4», and its elements called net transition 
rates. The corresponding transition matrix will be called the net transi- 
tion matrix and written Pggj+sj.-\4,(t), and its elements called net 
transition probabilities. 

ec. From a given rate matrix another characteristic that can be 
derived is: for each pair of states S,, 8;, the expected time spent in state 
S; during the period [0,7,.], 0=7T,=T, for a patient who is initially 
in state S; This expectation will be noted by e,;(7.>); it is given by 


» To 
eij(To) -{ py(t)dt 
0 
where pi(t) is determined by Q from (2.3). The integral is the limit 
as N-»o and | t, —t,, || 0 of the approximating sum 
ud 
> Py (tn-1) pis (tn — tn-1) : (tn — bas) 
n=1 


where 0—t, <t, <<: **-< ig=Tp. 
The matrix of expectations 
e(T >) = {e;(To)} 1,j==1,- ‘ -,k 


may be computed using (2.3). We have 


oT, 
e(T.) = ( P(t)dt 
0 
*T. *T> T, {2 
(3.1) —{ 1dt+Q { idt+Q? f Le oe 
“ 0 7 0 «70 2! 
T.* T,* 


=—/T,+ or + Br 


The matrix e(7') will be called the residence matrix and its elements, 
residencies. If Q = Qag}.--|u» is substituted in (3.1), then the resulting 
residence matrix will be called the net residence matrix and written 
€ag|---|u»(T'o). Its elements will be called net residencies. 


It is clear from the above that the transition and residence matrices 
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depend entirely on Q. It is convenient therefore to estimate Q first and 
then to estimate the transition and residence matrices using the relations 
(2.3) and (3.1). 

4. ESTIMATION IN THE MODEL 


The estimate of Q is based on a series expansion of Q in terms of 
P(t). Writing P*(t) = P(t) —TI, (2.3) becomes 


(4.1) I + P*(t) ma. 


Proceeding heuristically we then write 


P*(t P*(t)) 
(4. 2) Vt = log + P*(t)) = P*(t) — ais 43 2 —— 
~ vo 


In appendix 1 it is proved that (4.1) and (4.2) are equivalent provided 
the series in (4.2) converges elementwise. A necessary and sufficient 
condition that the series converges is that the roots of P*(t) are all 
less than one in absolute value (Wedderburn, 34). This condition is 
not very convenient to apply in practice as it may involve considerable 
computation. For our purposes the following condition is more useful. 
If 

(4. 4) pu(t) > 4 for t—1,---,k 


then the series converges. The proof is given in appendix 2. 


of time, starting 


, 


Let us suppose now that at periodic intervals ¢ 
from their respective times of entry into the study, patients are classified 
into one of the states S,,- --,S,. If a particular patient is classified 
s times in this way, his observed path will consist of the sequence of 
states S;,,Si,,- - -,Si,, where S;, is the initial state, S; the state after 


time ?¢’ has elapsed, ete. The likelihood of this path is 


TinPici,(t’) Ping (l’)~ > > pi,se,(t’) 


where z;, is the probability of the patient entering the study in state S,, 
while the remaining probabilities correspond to the succeeding transi- 
tions. Assuming that patients behave independently, the likelihood of 
the observed paths of n patients, after rearranging, is therefore 


pis( ts 


1 


k 
(4.5) L =[[ x" 
#=1 t 


k 
J 


where n; is the number of patients entering in state S;, ny is the total 
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number of transitions S;—>S; and n=n,+---+-++ nx. In order that 
the likelihood not vanish we define m"*=—1 if nj =O and similarly we 
define p,(t’)"’ —=1 if n;—0. A straightforward computation yields the 
maximum likelihood estimate of pj;(t’): 


Nis 
Mine + Mx 


We observe that the z;’s do not enter, so that the estimating procedure 
is independent of the particular scheme of entry of the patients. We 
also observe that in order for j,;(t’) to be defined, the denominator in 
(4.6) must not vanish. The interpretation of this fact is that some- 
where in the course of the follow-up study patients must be observed to 





(4. 6) pil’) 


be in each of the designated states. 

In estimating Q we consider first the case that there are no a priort 
restrictions on the elements of Q, and second that there are restrictions 
of the form q 0 for certain of the transition rates. These restrictions 
would occur when it is known that the corresponding transitions S;— S, 


cannot occur. 
Case 1. Noa priori restrictions on the rates. Letting 
P(U’) = {pi(t’)} and P*(t’)—P(t’)—I, 
the estimate of Q is taken to be 


(4.7) Q - 5 [P*(r')— 4(P*(t’))? + 4(P*(t’))8 —- +] 

provided the series converges. If the series converges, Q is the unique 
maximum likelihood estimates of Q. It is not certain, however, that 
the series will converge even if the roots of P*(t’) are less than one in 
absolute value, since P*(t’) is a sample matrix and some of its roots may, 
by chance, exceed one. However, the probability of the series not con- 
verging can be made arbitrarily small by choosing ¢’ sufficiently small. 
Condition (2.2) states that the parameters pu(t’) converge to 1 as t’ 0, 
and the ;,;(¢’)’s, being maximum likelihood estimates of the pi(t’)’s, 
converge in probability to the latter. Hence, a practical solution is to 
choose ¢’ so that the p,(t’)’s are (sufficiently) greater that $. This will 
make the probability that any of the pi(t’)’s exceed $ (and the series 


possibly not converging thereby) small. Some a priort information as 
to the magnitudes of the py(?)’s as functions of ¢ is needed in order to 
choose ¢’ suitably. In this paper we will assume that the series in (4. 7) 


does converge. 
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Case 2. Some of the parameters qi; are known to be zero. In this 
case we take as our estimate of Q a modification of Y. First Q is com- 
puted by (4.7). Then if gag is known to be 0, Gag is replaced by 0 
and daa altered so that (2.5) is satisfied. The same is done for all 


gas’s known to be 0. Denote the altered matrix by Q. Usually the 


elements of Q@ and ( will differ only slightly. However, if the size 


aR 


of ( is large and there are enough zeros in it, in the right places, then 
ae 

the subsequent computations involving powers of Q will be considerably 

lightened. 


A trial caleulation is sufficient to show that Q is not, in general 
ikelihood estimate of Q, subject to the given restrictions. 


> ime 


~~ 4 


the maximum 

The estimate Q has been chosen because it is the simplest reasonable 

estimate in sight. It can be shown that as t’—>0 the relative asymptotic 

efficiency of the estimate q and the maximum likelihood estimate of qi; 
a 

converges to 1, for each element OF of @. Therefore, at least for ¢’ small, 

the possible loss of efficiency in using 0 instead of the maximum likeli- 


hood estimate will be small. 


On the other hand, 0 and QO are both consistent estimates of Q for 
case 2, so that the large sample distributions of the same rational 
functions (e.g., pi(T) and ;(T) defined below) of Q and @Q are the 
same. This follows from the theorems of sections 20.6 and 33.3 of 
Cramer’s (°46) book. We are justified therefore in neglecting the dis- 
tinction between 0 and 0 when treating of the large sample distribution 
properties of such rational functions. We will, in fact, use the nota- 
tion 0 for either case 1 or case 2 estimates. 

Starting with the estimate 0 of Q, the estimate of P(t) for any f, 
0=t=T, is defined to be 
(4. 8) P(t) —14+-6t+@L+ae 

| ide 
obtained by substituting 0 into (2.3). Similarly substituting Q into 


(3.1) we obtain the estimate 


(4. 9) T,) «IT, 4+-0—*. 4- O° +- 


of e(7,). If some of the transition rates in Q are suppressed giving an 


estimated net rate matrix of the form Qggj..-\,,, then the corresponding 
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net transition and net residence matrices are defined similarly. The 
suppression of rates in 0 to obtain Qub\. iu» Should not be confused with 
the suppression of rates in 0 to obtain 0, the case 2 estimate. The latter 
is an estimate of Q, while Qag|---j~» 18 not. 

It will be observed, in forming the matrix products (P*(t’))" and Qn 
for n= 1,2,: -- that the sums of the elements of the rows of these 
powers equals 0. This provides a convenient check in computing. 


THE LARGE SAMPLE DISTRIBUTION OF THE ESTIMATES. 


The joint asymptotic distribution ot the dij 8 will be found first. 
As explained in the previous section, it is sufficient to consider only 
the case 1 estimates. The only restrictions on these estimates are that 


-gi= DS jj for i—1,---,k. Therefore, to avoid a singular dis- 


tribution we consider the jo int distribution only of the gij’s for 1). 
The distribution of the ¢;;s can be found, if necessary, from the dis- 
tribution of the @;,;’s, i j. The ij 8 for i) satisfy the regularity 
conditions of section 33.3 of Cramer’s (746) book; hence we may con- 
clude that these estimates 


1. converge in probability to the corresponding elements of Q, 


2. are jointly asymptotically normally distributed with the asymp- 
totic covariance matrix equal to the inverse of the matrix 


u log L 
{_ (= « -)t a, B,y,8—=1,°--,k; a By KB 


where € denotes “‘ expected value of.” 


From (4.5) 


a log L Ke 1 (opult’)* 
iti << jj — [ - ( ©) 
( ( 09ap™ ) i i= j rer (n u) pit’ )? Odap 


1 0? pit’ >) 
+540 ( up ]. 


To calculate E(n,;), first consider the path of a single patient, starting 
from time 0 in the initial state and classified subsequently at times 
t’,2t’,- - -,st?/—=T the length of the study period. Let zg be the 
number of transitions from state S; at time (a—1)? to state §; at 
time at’. Then the expected number of transitions 8,8, for the 








100 SAMUEL ZAHL 


whole path is €(7,+2.+:---+a,) —€(z,)+--+:-+€(a,). Now 
E(x.) =pr((a—1)t)p(’) a= + - *,8 


if S, is the initial state. Therefore 


€(z,+:---+2,)—> Poi( (a — 1)t’)pi(’). 


If there are n, patients with the initial state S, for b—1,---,k and 
ni; is the total number of transitions 8; S, for all the n = § ny patients, 
then 


k 
E(nyj) = S ny > po((a— 1)t’ pil’). 


b=1 a=1 


Substituting in (5.1), 


07 log L \ e-3 
me pf See Ce ee 
(Fre) ~~ EOL Em Zoncar | 
[ 3 re) A (eOy7 
pis (tv) Jap , pit’) Oqap* ) 
— =[ =m: ny & Poi (at’) wth i(f >) ] 
j =F O7Fap 


An(t’)\2 
>> > pri (at’) Ee) ] 5 
b { Pij( (t’ ) 0dag ‘ 


, 
If ¢’ is sufficiently small while 7 remains fixed, this last expression 
can be approximated by one of simpler form, for we have 

















I 








lim ¢’ 5 > Pri(at’) = en (T) 


t’-0 bar 





s¢°=7 
4 we 
. \\ 2 if ia 
lim & 7 a ( Pu ’) _ 4 Jas 
_— j © Pi Odap 0 oe 


using |’Hospital’s rule for the second limit. Therefore 


; Vlog L 1 
(5.3) lim e(‘ = “) S nena T). 
Joe 0G ap” Tap »b 
st’= 


The expressions (5.2) and (5.3) have been calculated for two 
systems of transition rates and for several values of ¢’ in order to give 
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some idea of the effect of ¢’ on the accuracy of the approximation (5.3). 
The first system is one used as an example by Fix and Neyman. This 
system has four states with the rate matrix 


eT 1.0 2.7% O 

0 0 0 0 
= 2 CC — £ 2 
0 0 0 0 


The interpretation of the states is of no consequence for this illustration. 
For this system it is assumed that all patients have the initial state 8). 

The second system was chosen arbitrarily and consists of three 
states with the rate matrix 


(— 65 45 .20 
Q=! 1.20 —1.97 UY 
| = 33 68 —1.01 


It is assumed that the initial states are divided equally among the three 
states. 


TABLE 1 


Asymptotic Variances of Estimates of Transition Rates for Two Systems 
of True Values of Rates 





PROPORTION t’ OF TOTAL PERIOD OF OBSERVATION 7' 

















TRUE RATES BETWEEN SUCCESSIVE CLASSIFICATIONS 
t’ —_ VT y = WoT ¥ — ooT t'> 0 
7 dia = 1.0 3.59/n 3.57/n 3.56/n 3.56/n 
System* qs = 2.7 10.30/n 9.92/n 9.74/n 9.60/n 
I da= 2 .60/n 52/n AT/n 42/n 
Gan 2 42/n 42/n A2/n 42/n 
Gu = .45 42/n 38 /n 35 /n 
é.2 30 19/n 17/n .16/n 
System** Gu = 1.20 1.72/n 1.63/n 1.54/n 
II Ga 28 1.13/n 1.06/n 99/n 
Ga == 33 42/n .38/n .35/n 


a2 .68 .83/n AT /n .72/n 





* For system I, n is the number of patients with initial state 8,, the only 
initial state. 
** For system IT, equal numbers mn of patients have the initial states 8,, 8,, 


and §,. 








equal to gag 


in this normal! distribution 


Estimat 
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find, similarly, 


asymptotic covariances 


(0 log L = 

(Seca )-3 i=L2 por 

DB; re a lere) 
i, (t ) dqag Oqys 


=) as f->0 


a 


, $30, and st’ =—T. 

( d that oO m iar il I’ sma the asS ior @ B are 

t rn ndependently distributed with the mean of 
and the variance equal to 


_ Was 
> Nreva(T’) 


i) 


of (5.2), (5.4) and (5.5) are obtained by substituting 
} — — ; , ' a 
sample estimates for the corresponding parameters. For example, in 
: ; On;;(t’) 
5 ® if , j ‘ a | i+yy4+ 1 fan 
( Pa } b (at -P (Cf ) ind . are su stituted LOT Poi 
0Gas 
Op (t’) 
and - 


0 708 
Turnin 
these estima 
mean of pi (t) 


(5.6) 


where tl} 


and (n) 
n 
interval petween 
(5. 7) 
The asymptotic me 
asymp OUl Varlal 
(5.8) 


respectively. 


g now to p 


es : 


he deriva 


(at’) > py(t’), 


t) and @,,(t) for 0 
ymptotically no 


(¢) and the asymptoti , 


-¢{=T, it can be shown that 
rmally distributed. 


a> 


The asymptotic 


rian 


e is 
1 < fapilt)\ (Api) 
x > f C’ apys(n) 
MN , £8 079ap 0q, 
tives are evaluated at the true values of gag, q+, ete. 
Aa svm eas ; of dea and For ?’ (1 
successive observations) small this is approximately 
z Op (t) 2 7ap 
a+p 0Fap >> Neral T) 
) 
in of é;(¢) is e(t), while the exact and approximate 
s, correspo iding to (5.6) and (5.7), are 
] = ‘ de;,(t) 0e; (t) 
2 21; aby8() 
nm ax~pyx+o dag 0qy5 
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and 


: de;,(t) \? Jas 
(3.9) > — ys 7am ° 
a+p 0dap how Nvevadl') 
d 
The proof is deferred to appendix 3. Estimates of (5.6), (5.7), (5.8), 
and (5.9) are found by substituting gag for gag throughout. 
In computing the derivatives in the preceding formulas, the matrices 
§ dpis(t) | ( de;,(t)) 
( Oqap $”° | Oqag J 


can be computed as follows: For any matrix A = {aj(xz)} define the 
derivative of A with respect to x as the matrix of the derivatives of its 


OA f da (r) } 
OL / Or \ ; 


00 
t 
07ap 


A,=44,Qt + $QtAy 


()-t- 
A 44.0t } } — A, 
Vt 
A,= +A Vt T (SP) a, 
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6. TESTS OF HYPOTHESES 


In this section we are concerned with two processes, having the same 
sets of states but with possibly different rate (and hence different 
transition and residence) matrices. We shall distinguish between the 


two processes, in our notation, by means of the superscripts “(1)”, and 


uestions that might arise in connection with a follow-up 
study are: 

1. Are the rate matrices of two periods the same or not? Such a 
question might arise in deciding whether to combine two adjacent periods 
into a single one of length equal to the sum of the two original periods. 
Or it might arise in deciding whether to combine two sets of data, 
possibly collected at different places, representing parallel studies. 

2. If p(T) and p(T) are transition probabilities for the period 
[0,7], corresponding to two different treatments (and hence two dif- 
ferent processes), are they the same? 


3. The same question as the previous one with respect to e“(T) 


Considering question one first, an approximate large sample test of 
the null hypothesis that the rates gag and qag® with a are equal, 


against the alternative hypothesis that qap’? ~ qag", is based on the 


Yaf " Gag 


\ a (Gap )) sk jap = 


where o7(¢a8") and o7( (ag) are (estimates of) the asymptotic variances 
of gaa and «, espectively. These are given by (5.2), or (5.3) if 


t 1s small. From section 5 it follows that U as is distributed approxi- 


] +] last . , f -# 3 x mi : . 
mately as the absolute value of a unit normal variate. Che hypothesis 
of the equality « ind dq is rejected for a large value of Uag 

. i > . ~ 
(depending on the significance level). It will be noted that T@ need 
not er 1a] ] 
For ¢ small we can also test the equality of several rates jointly. 
Let the null hypothesis be that 
—— 1) ; "ee 
Ya,8 —= Ja:B,' » Vacbs GacsBs ‘> > Ya,B; Ya,B 


where 4,8; for 1—1,- - -,p and the (a, 8;)’s are distinct, and the 


alternative hypothesis be that at least one of the equalities fails to hold. 
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Since the gag’s are asymptotically independent as no and t—>0, it 


follows that for large n and small 


p 
(6.2) wap.” = Xo" 
t=1 


is distributed approximately as x? with p degrees of freedom. In par- 
ticular, we can test the hypothesis that Q —Q© against the alternative 
that Q® ~Q® by summing (6.2) over all (a,8)’s for which «8. 
The hypothesis of equality is rejected for a large value of xp? (depending 
on the significance level). 

The significance levels of the tests based on (6.1) and (6.2) should 
not be taken too literally, as the tests are based on several assumptions: 
the normal distributions assumed for the gag’s are asymptotic, not exact ; 
the variances in the denominator of (6.1) are estimates, not exact 
expressions ; and unless t’ = 0, the gag’s are only approximately indepen- 
dent, asymptotically. Nothing much more specific can be said about 
the effect on the significance level without inquiring more deeply into 
the distributions of the statistics (6.1) and (6.2), or conducting some 
sampling experiments. 

With respect to the elements of the transition and residence matrices 
for 7) = T° —T, statistics similar to (6.1) can be used for large 
sample tests of hypotheses of the form pas(T) =—pag™(T) or 
éag’) (T) = eas (T) for a, against either one or two sided alter- 
natives of inequality. For example, to test the hypothesis that 
Pas (T) = pas (T) against the alternative that pag™(T) >pas®(T) 


we may use the statistic 


" 7 ba(T) — fap®(T) 
ai = Vo*(pas(T)) + 0%(pap(T))” 
which is approximately normally distributed with zero mean and unit 
variance, for large n. The hypothesis of equality is rejected for large 
positive values of V ap. Here again o*(pas(T)) and o°( pap (T)) are 
(estimates of) the asymptotic variances of pag(T) and pag’(T’) respec- 
tively, and these variances are given by expressions of the form (5.6), 
or (5.7) if # is small. 


7. TESTS OF GOODNESS OF FIT OF THE MODEL 


We have so far dealt with tests of hypotheses assuming that the 
model fits the disease process adequately. There are certain internal 
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checks (i. e., checks based on the data alone) of the goodness of fit of 
the model that can be made. These checks are of conditions 2 and 3 
of section 2. 

Condition 2 is: the probability that a patient in state S; at any 
time during the study, is subsequently in state S; at a later time in the 
study, depends on the two states in question but not on the previous 


} 


history of the patient. As an alternative to condition 2, we may take 


the hypothesis: if a patient at times ¢., and ¢, (¢., < ¢)) is in states S, 


and Sj, respectively, then the probability of his being in state S; at time 
t, (to < t,;¢.4, to, ¢,; any times in the period of observation) depends on 
S, as well as on S; and S;, for at least one (triple S,,8;,8;) of states. 
A test of condition 2 against this alternative is the following. 
Assuming that the interval between successive classifications of n patients 


am ff } t] numbey f trancit g @ ry p 
18 , bE t)¢ tne hnumoder Ol ranslvtions « — sO Ss. [herefore 


> Nraij= ni. If condition 2 is satisfied, then for each pair (1,7), 


for at least one pair ( 1.7). the expected values of py;;(t’) for h + k 
will not be the same. Accordingly, we may use a y? test of homogeneity 
of proportions. It will be observed that if, and only if, gq is zero, then 
the proportions (7.1) are either identical y one or zero tor all samples. 


Letting F be the set of indices i for which g, is not a prvort zero, we have 


* ~ En, ) 
(7.2) "7 7 3s 
—_ -~_- -_- % 
it j cil 


% 


If FE has m elements, then n, the degrees of freedom, equals 
m|(k 1)n (ki: 1)]. A large value « \ renders condition 2 
suspect. 

The distribution of y Is asvymptot 1 \ icu-e. For finite n 


the approximation is poor if any of the maj;s or (Snqij)’s are small. 


If any of these quantities are small, it will be better to partition the 


ns into fewer than m parts. Suppose we partition the set E£ into 
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b<m subsets, L,,H#.,---,H». Define ng; as the number of transi- 
tions Sg,—S;—>8S;, where «eH,. The formulae of (7.2) are then 


Va 


replaced by 
(7.3) wa z > 


(nE,4j — Enp,ij)” 








h=1 4e8 j=1 Engyyj 
( > Np, ij) ( 2 Np,ij) 
Etay — 2 —_ 3. 
= > news 
h=1 f= 


while v = m|[(kK—1)b —(k—1)]. 

However, it may happen that no matter how the nj;’s are partitioned 

there will be a sum Snp,i; or an expectation Eng; that is too small. 

h=1 

In this ease, an alternative test can sometimes be used. The indices 
1,- - -,k are partitioned into d subsets (X= dk), F,, F.,- +--+, Fa and 
np; Gefined to be the number of transitions Sg—S;—8S; where 
a2eF,. The matrix {nj} is thus partitioned into d matrices {np,g}, 
(hk =1,:--,d), from which are calculated matrices Q,, (hk —1,- - -,d). 
The Q,’s are then tested for equality pairwise by the test of the pre- 
vious section. This check involves considerable computation for d > 2, 
however. 

If condition 3 is violated, one way in which it is likely to be violated 
is that some of the elements of Y undergo an increasing or decreasing 
trend with time. The ordinary life table regarded as a two state 
process with the states “alive” and “dead” is an example of a process 
which violates condition 3 in this manner, if we take as the total period 
of observation the first five years. Here the transition probability from 
the state “alive” to the state “dead” for one year decreases consider- 
ably from the first to the last year. The condition is better met for 
five vears in the middle of the table. As a test of condition 3 against 
the alternative of a trend, the total period of the study can be divided 
in half and n,™ and nj computed for the first and second halves 
respectively for te F and 7 =1,- - -,k, where EF is defined above. Again, 
Wwe use a x test of homogeneity with 


2 ( 
- cry Stu ens? 
h=1 ieE j=1 Enis! (») 
k 
(21 i) (Sr ) 
Eny™ =? as 
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Here w= m|(k—1)2—(k—1)]. Alternatively, we can compute Q® 
and () for the first and second halves respectively and test these 
matrices for equality. 

What do we do if these tests show that condition 2 or 3 is apparently 
violated ? 

If condition 3 is violated, the total period can be divided into sub- 
periods small enough for the rate matrices of each sub-period to satisfy 
condition 3 approximately. The method of section 8 is then used to 
join the sub-periods together. 

If condition 2 is violated, it may be possible to substitute a higher 
order Markov process, ¢.e., one in which the transition probabilities are 
determined by two or more preceding states. The generalization of the 
method of this paper to such a higher order model is straightforward, 
but will not be done here. 


8. TREATMENT OF TWO CONTIGUOUS STUDY PERIODS 


Assume that there are & states for each period and that the first 
period is from t—0 to t==-T, and the second from tT, to t—T). 
For the first period, let Q, be the rate matrix, 


P,(t) ={pyM(t)} O<tS<T, 
the corresponding transition matrix, and 
4 
(8.1) a(t) —{eu(t)} — J P(t)dt O<U<T, 
0 
the residence matrix. For the second period, let the corresponding 
matrices be Q., 
P(t —T,) = {pis (+ T,)} 1. Stay, 
and 
»t’ 
(8.2) e,(t’—T) = {e, (t’ —T,)} = P,(t—T,)dt 
« 7; 
7:2f°S7,. 
It follows that if at time 0 an individual is in state §,, the prob- 
ability p(t) of being in state S; at time ¢ is 
pyM(t) OStST, 
p(t) =< & . : 
D> pia (11) pas (t — T;) 7.63 te. 


| a=1 
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In matrix notation, this is 

P,(T) 0OStsT, 
(8.3) P(t) = 

P,(T,)P.(t—T,) 7. StsTte. 


In terms of Y, and Q2, 


T+Qt+ Qt te 0<t<T, 
(8.4) PC) =} T+ 1Q7, eed: 
| + (QT: + Qt—Ti)] 5 + -. T,<t<T,. 


From (8.1), (8.2) and (8.3), we have, for 7, =? =T,, 


(8.5) e(t’) = {Pa fPwas fi P(t)dt 


-f" P, (t) yat + Pa(t) ff Ps P.(t—T,)dt 


=e,(T,) + Pi(T,)e.(t’ —T;). 


As in the case of a single period of study, these formulae hold also 
for the estimates 0:, Q>, P, (t),- - - or net estimates 


0: af |-y5|...» Qs of|75|.... P1-a8|79]...(¢), ete. 

The results of section 4 concerning the large sample distribution of 
the j(t)’s and é,;(¢)’s extend without any difficulty to the present case, 
and we may state that for any pair of indices 1,7 1,- - -,k, p(t), 
T, tT, is asymptotically normally distributed with asymptotic 
mean pij(¢) and asymptotic variance 


l pil t)\ ( Opis t 
(8.6) -~>> (se) (Fes) ‘aps (n@) 


W afB #5 0qas™ 
1 dpis(t) (Pet) c 
a : (2) (m() 
? n) 2, 2, (25 TN Oqys Capo (n®). 
For ¢t/ small we may use approximations of the form (5.7). The 
formula (8.6) is the generalization of (5.6). 
The matrix 
@P(t) 
dq op” 
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can be found from (5.11) using the relations 


0P(t) a 6P,(7 D P(t — T.) 
Oqap’”’ Ogag*” 


éP(t) éP.(t —T',) 


> ry? - 
ay = Pi(T1) rc 
Ody a 0g <) 
derived from (8.3). 
Similarly é,;(¢) for T, StS T, and i,j —1,- - -,k is asymptotically 


normally distributed with asymptotic mean e,;(¢) and with the asymp- 
totic variance given by an expression of the same form as (8.6) but 
with e;(t) replacing pi(t). The same remark as before applies if ¢’ 
is small. Furthermore 
de(t) 
Oqap” 
| 


is computed using (5.11) and (5.12) and the relations 


de(t) _ de, (t) . 6P,(T,) 


i eh e(t—T 
Oqas” Ogag'” 0qap* " ) 
je(t) _ belt —T 
uP it.) 
Oqya”’ dqys” 


derived from (8.5). 
An estimate of (8.6) is obtained in the usual way by substituting 


estimates of the p’s and q’s for these parameters throughout. 


9. EXAMPLE AND COMPARISON OF FIX AND NEYMAN’S 
AND OUR ESTIMATES 


We consider in this section a single period only. It was not possible 
to obtain any actual data with which to illustrate the estimates of this 
paper. Instead, samples were generated artificially based on a system 
of parameters (already given in section 5) used by Fix and Neyman 
(751). To generate the samples, the Fisher and Yates (’49) tables of 
random digits, and the Rand Corporation series of random digits 
appearing in recent issues of the Journal of the American Statistical 
Association were used. 

The samples so obtained enable us to compare the estimates of Fix 
and Neyman with ours to give an indication of the relative performances 
of the estimates. As a basis for comparison, the sample variances of 
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the estimates of the risks, for the two estimates in question, have been 
used. As the estimates of this paper are based on somewhat more 
information than Fix and Neyman’s estimates, it is to be expected that 
the sample variances of our estimates will tend to be less. The question 
of greater interest is how much less; unfortunately the present experi- 
ment is too small to answer this question. 

The example that we use has the rate matrix 


—3.7 10 27 O 
0 0 0 0 


Briefly, the interpretation is: 


S,: active cancer 


TR 


: death from cancer or operative death 
S,: cured or apparently well 


S,: lost from observation or death from other causes. 


The a priort assumptions are that q:4 = 22 = Jae = as = 0. Letting 
the total period of observation equal 1, and observing the “ patients” 
at intervals of 1/10, the true transition matrix for the interval 1/10 is, 
by (2.3), 

( .693 084 2k 002 ) 

, 10 1 0 0 | 

P(W/10)—= |) 16 001.963 .020 

LO 0 0 1 
On the basis of P(1/10), a total of 3000 paths were generated, half 
with the initial state S, and half with the initial state S;. For example, 
the first four of these paths are: 


1113 3 38 38 8 3 8 
1133 344 4 4 4 
113 3 3 383 3 3 3 3 
ae oe oe oe ee oe oe ae oe 


The 3000 paths were divided up into 20 groups. The first ten groups 
each contained 50 paths with initial state 8, and 50 with initial state S, ; 
the second 10 groups each contained 100 paths with initial state S, and 
100 with initial state S,. The paths were subdivided into these groups 
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according to their order of generation. Following the notation of section 
4, table 2 gives the numbers of transitions S;—>S; for each sample. It 
will be observed that the remaining nj;'s are a priort 0 or 1. 

The a priori restrictions on Q are such that gis, gis, Gai, and ag 
determine (, and therefore it is sutlicient to give the estimates of these 


parameters only. Our estimates of these parameters are based on (4.7), 
TABLE 2 


Numbers of Transitions in Twenty Samples 


SAMPLE Nyy Ny» This his No Ngo Nss Nas 
l 124 16 37 0 10 0 638 13 
2 165 13 40 0 8 l 636 1] 
3 142 15 40 0 Ss l 611 15 
4 137 22 29 0 5 l o9T 1] 
5 140 13 36 0 6 0 620 16 
6 126 12 40 l 6 l 666 15 
7 114 12 37 0 6 0 642 16 
8 153 13 43 0 12 0 609 15 
9 110 16 36 ] 4 0 619 15 

10 140 16 37 0 10 2 609 17 
ll 238 $4 66 2 19 l 1170 33 
12 236 338 79 0 23 0 1246 24 
13 231 24 SY l 23 z 1339 20 
14 235 31 78 l 16 0 si3 29 
15 27 35 81 0 26 l 1241 24 
16 288 25 83 2 24 l 1300 22 
17 232 20 85 U 13 l 1303 35 
ls 239 34 74 2 19 l 1286 22 
19 293 28 SO | 20 fT) 1286 24 
20 255 26 81 l 15 2 1264 30 


while Fix and Neyman’s estimates are based on the formulae of page 
230 of their paper. It will be noted that their state S; is our state S;,, 
for 1—0,1,2,3. The estimates for the 20 samples are given in table 3. 
Fix and Neyman’s estimates for samples 6 and 12 are infinite and 
have therefore been omitted. 

In table 4 are given the sample variances of the transition rates 
based on the two groups of samples, 1 to 10, and 11 to 20. It will be 
seen from table 4 that the estimates of this paper, with one exception, 
have the smaller variances. 


In practice, one would also be interested in the matrices P(1) and 


e(1), and therefore in the variances of the estimates of these two 
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matrices. It can be shown, however, that the variances of the pi(1)’s 
and é;;(1)’s are approximately equal to linear combinations, with positive 
coefficients, of the variances of the gjj’s, so that for an indication of the 
performance of the estimates in general, it is sufficient to examine the 


sample variances of the dij 8 
TABLE 3 


Estimates of Transition Rates: (A) Ours, (B) Fixw and Neyman’s 


ESTIMATES OF 


Qs 1.0 Gn = 3.7 qu = 2 qu = 2 

SAMPLE* A B A B A B A B 
] 1.08 1.06 2.54 2.21 18 17 20 13 
2 68 79 2.14 2.04 14 00 17 22 
3 89 1,20 2.43 2.71 15 05 24 38 
4 1.36 1.34 1.83 1.88 10 .08 18 14 
5 80 67 2.25 1.64 a .06 25 16 
6 .79 - 2.70 — 11 — 22 — 
7 88 72 2.75 2.10 11 15 26 13 
8 73 96 2.45 2.95 23 AT 24 04 
9 1.19 1.28 2.72 2.63 08 .00 24 36 
10 97 94 2.30 2.28 19 35 27 01 
1] 1.52 1.37 2.33 2.13 19 21 28 29 
12 1.31 2.78 — 22 — 19 — 
13 84 1.02 3.19 3.58 21 12 15 .20 
14 1.08 1.12 2.78 2.79 15 21 22 16 
15 1.08 1.40 2.55 3.2) .24 .25 19 lé 
16 74 84 2.49 3.07 21 44 17 —.02 
17 71 60 3.09 2.62 12 04 .26 BR 
18 1.17 1.05 2.59 2.49 18 18 AT ll 
19 Sl 84 2.37 2.38 18 13 18 23 
20 B85 1.00 2.70 2.91 14 10 23 19 


*Samples 1-10 consist of 100 paths, half with initial state S, and half with 


initial state S;. Samples 11-20 have twice as many paths in each category. 


Besides P(1) and e(1) the other matrices of interest in this example 
are P.,(1) and e,,(1), which are the transition and residence matrices 
with the “ inessential ” risk g,, suppressed. The estimates of P(1), e(1), 
P,,(1), and e,,(1) have been computed from (2.3) and (3.1), only 
for samples 1 and 10, and only for the estimates of this paper. These 
estimates are given in table 5. Together with estimates of their variances 
obtained from (5.7) and (5.9), they are examples of solutions of the 
estimation problem in our model. 
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TABLE 4 


Sample Variances about True Values, of Estimates of Transition Rates, Based on 


Two Sample Sizes of (A) Our Estimates and (B) Fix and Neyman’s Estimates 


SAMPLE SAMPLE SIZE rOTAL NUMBER 
VARIANCE NUMBER OF PATHS OF 
OF ESTIMATE IN EACH SAMPLE SAMPLES 
100 200 
a A 046 064 10 
q; 
; B 052 058 9g 
™ A .160 073 10 
dis 
B 340 192 9 
. A 006 002 10 
qa 
B 026 012 9 
- A 002 002 10 
qs 
F B O15 010 9 


TABLE 5 
True Values and Our Estimates of P(1), e(1), Py(1), and ey (1) 


WITH RATE gd, SUPPRESSED 


‘ 
ESTIMATE SAMPLE SAMPLE ESTIMATE SAMPLE SAMPLE 
OF l 10 OF * l 10 
Pull 054 .053 .043 Puw(1) 036 057 .047 
Pr(1 281 307 406 Pro. (1) 324 .308 .408 
Pig (1) 570 549 444 Prs.c (1) .641 .636 545 
Py(1) 095 091 106 
Pal 042 040 .036 Pu.s(1) O47 .046 .045 
Ds (1 035 035 .048 Pes. (1) 086 0338 054 
on (h) = .75) 152 .687 Pes. (1) 866 916 902 
Py (1) 172 173 .228 
ey, (1 281 .285 .268 Cerca ( 2) .162 286 .269 
€,,(1 .205 223 301 13.04 (1) 2509 224 302 
€:3(1 AT4 453 38u €13 0 (1) 583 490 429 
€,,(1 040 038 .046 
Cx: (1 035 .033 .032 Cus os (1) 043 .035 035 
Ca (1 015 O15 021 ss. 04 (1) 039 .016 .022 
€s3 ( 1 860 862 827 Genes (1) 918 .949 942 
ey (1 090 090 121 


£ 


* piyu(1) is the ij-th element of P,,(1); similarly for the residence matrix. 
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In table 5 we see that sample 1 happened to yield estimates that are 
quite close to the true values, whereas sample 10 with twice as many 
observations has not done as well. In practice not all of the estimates 
in such a table as 5 may have a useful interpretation. In our case, 
€;.(1), for example, would have little practical significance. 

It may be of interest to know that to compute P(1/10), Q, P(1), 
e(1), Ps,(1), and e3,(1) for a single sample took about 6 hours. A 
4 state model with no a priori restrictions would have taken about twice 
as long. It took about 15 terms of the series (2.3) and (3.1) to secure 
convergence accurate to 5 decimal places. To compute @ took fewer 


terms. 
APPENDIX 1 


Theorem: If 


NOY 4. EOF... 
2 


Al. 1) Ot = P*(t) — 
( ‘ (t) 3 


converges (elementwise) then (A1.1) is equivalent to 


\1.2 P(t) 4+ +S 4 
(Al. #) (4) = reat eat . 


Proof: The coefficients of (A1.2) are the same as the coefficients 
of the power series representation of e7-—1—y: 


> 


r 


(A1.3) yrrt tat, 


while the coellicients of (Al.1) are the same as the coefficients of the 
power series representation of r log(1 +y), for | y | <1: 


(A1. 4) ganged 4 EW. « «, 
First assume (A1.2) is true. Substituting (A1.2) into (Al.1) we 

obtain on the right side of (Al. 1) the series 

(A1.5) Qt + a,(Qt)? +a,(Qt)®+- - - 


where @,, ds, - - are certain constants. The series (A1.5) converges by 
hypothesis. Substituting (A1.3) into (A1.4) we obtain on the right 
side of (Al. 4) the series 


(A1. 6) zt+aor*?+a,7°+::- 
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which has the identical coefficients as (Al.5). This follows from the 
fact that P*(t), Qt, x and y enter only as algebraic elements following 
the same multiplication and addition laws. However, since (A1.6) 
converges and equals z for an infinite number of values of 2, all the 
coefficients a,a3,- - - are zero. Therefore (A1.5) equals Qt and (A1.1) 
is true. The argument for the converse is the same given the fact that 
the exponential series converges for all values of the argument. 


APPENDIX 2 


A sufficient condition for the convergence (elementwise) of 


oo —— m+i 
(A2. 1) S Prony) 
is that 
(A2. 2) pu(t) > 4 for *t—1,:- -,k. 


Proof: Denote the tj’th element of P*™(t) by p*,™(t). (A2.2) is 
equivalent to S pia(t)< $4 for t—1,---,k; hence | p*,j(t)| <4 for 


at ' 
1,j7=1,---,k. Now assume that 


(A2. 3) p*y™(t)| <4 for t,j7—1,---,k 


for some value of n. By definition, 
k 
pris (t) = > p*ia(t)p*aj"(t) 
a=1 
= > Pia(t)p*as™(t) — p*iy(t) 
a 


= » Pia(t)p*as™(t) + (p(t) — 1)p*(t). 


ast 


By (A2.2) and (A2.3) the absolute values of each of the last two 
summands is less than 4, so that | p*,"*(t)| <4. Hence, by induction, 
(A2.3) holds for all n. 

Now, by (A2.2), there exists a 8>1 such that | &p*,(t)| < 4 for 
t=1,---,k. Substituting 8p*,,(t) for p*i,(t), for all t, 7, it follows 
from what has just been proved that 


3 
5” 


5" p*,™(t)| <i or p*y"" (t)| < 9 
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for i,j=1,---+,k and all n. Hence, 
> |p + mt) — <4 for i,j—=1,-- -,k. 
axt bow = 


The infinite sum on the right converges, and since absolute convergence 
implies convergence, it follows that (A2.1) converges. 


APPENDIX 3 


We give here the derivation of the asymptotic distributions of py(t) 
and ej (¢). 
Observe first that (5.4) has the form 


n 
n> — h(a, B, y, 8) = neagys(n) 
bd Np 


where n=n,-+- - --+ nm, and Cggya(n) depends on n but is bounded as 
n—a2. Therefore, the asymptotic covariance of das and dvs for a-~B 
and y+8 has the form 


Rs 
(A3. 1) — C'apys(n) 
7 


where c¢’ag,9(n) depends on n but is bounded. In particular ggg has the 
. - 1 , l , 2 
asymptotic variance — C’ggag(n) = — C’ag*(n). 
. n n 


Now expanding ;,;(¢) in a Taylor expansion about the qagg’s, we obtain 


_ fo n44( t) : 
(A3. 2) \ ny (b}- - pij(t)) = \ n> ( a Y (ius — dap) + V nR. 
a+§ Vas 


The remainder F# is the quadratic form 


‘ = Ay 0° y Ct) . 
(A3. 4) > f -! (Jag — = Jag) (G75 — Jy5) 
axpy x 7 Od’ pq’ + 
where the derivatives are evaluated at points q' af; @ 8” - + intermediate 
to dab and Jap: jv and Qy5,° °° respectively. 


We show first that nf converges in probability to zero. Since gag 


is oimeniean normally distributed, we have 


7 1 ; 
(A3.5) Pr{| Gas — Yap | = —= C’ap(n)u} > 2(1— ®(u)) 
Vn 
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where @(u) is the unit normal distribution function. Let « and 8 be 
given such that 1 >e>0, and 1>8>0. First choose u so large that 
8 


A3.5 2(1—® 
(A3. 5) (1—#(u)) <7) 


(where i is the size of the Q matrix). Then choose n’ such that for 
n>n’ all the inequalities 

i l_ , 
(A3. 7) Cag(n)u <e a B 


Wn 
are satisfied simultaneously. Finally, for each n, define Z, as the set 
of sample points for which all the inequalities 


2 “ ® 3» 
(A3.8) Yas — Yas < C ag(n)u a--B 


it 
are satisfied simultaneously, and define Z*, as the complementary set. 
It follows from (A3.5) and A3.6) that for all n greater than 


Ad 
some nm , Say, 


Pr(Z*,,) < k(k —1)- = § 
k(k 1) 


and hence Pr(Z,) = 1 6 for n>n”. But on Z, it follows from 
(A3. 7) and (A3.8) that for n> 7’, 


A a , , > 
Vn | Yas — Yap {v6 qy5 | C ap(Nje ys(n)ur 


i c’ag(n)c’,s(n)u* < €. 
Vn 
Since « and $ are arbitrarily close to zero, it follows that the term on 
the left converges in probability to zero, and hence YnR, a linear 
combination of such terms, also converges in probability to zero. 


Now consider Vil af dap \- We have 
Pr{ uM das . Yap | : Cal nju ; = & | l —  ( u)). 


As u- 0, 2(1 P(u)) — 1, so that Vn Jap — Jag does not converge 


in probability to zero. 


It follows, cancelling VY n, that () (¢) / (Z)) has the same asymp- 


a Opi t) ‘ 
> . (Yap - . Jag) - 
ap 0dap 


totic distribution as 





A MARKOV PROCESS MODEL 119 


This is a linear combination of asymptotically normal variables, hence 
is itself asymptotically normal, with the asymptotic mean 0 and asymp- 


totic veriance 
ij ij t s 
1 Ss s (Ree) € 19) apron). 


n a7p775\ Aap /\ Og 
For ¢’, (the interval between successive observations) sufficiently small, 
this is approximately 


> (2eut\" — — 
af Odag Dd Nveva(T) 
o 


The asymptotic distribution of the é,,(¢)’s is derived similarly. é,(t) 
is asymptotically normal with asymptotic mean ¢,(¢) and asypmtotic 


1 Ge;;(t) Gei;(t) , 
Ma ( me) c= )e abri(n). 


For ¢’ sulliciently small this is approximately 


dex(t)\? Jap 
=/( se) D> Nveva(T) 


b 


variance 


SUMMARY 


The general, time stationary, continuous, Markov process with a finite 
number of states is introduced as a model for a follow-up study. This 
model may prove useful in situations in which several states of the 
disease can be distinguished and are of clinical importance, as opposed 
to just two states, life, and death. A special case of this process was 
used by Fix and Neyman (751) for a 4 state model. 

The estimate of the parameters of the model, called transition rates, 
is based on observations of patients at equal intervals of time ?’, for a 
total period of time s’==T. Letting P(t’) be the matrix of transition 
probabilities for t=’, and P(t’) the maximum likelihood estimate of 
P(t’), the matrix Q of transition rates is estimated by 


Q = log P(t’) = (P(t) —1I) — (P(t) —D*/2 + (P(t) —D¥/3—: - -, 


provided the series converges. A simple condition for convergence is 
that the diagonal elements of P(t’)—T be less than 4. 
The asymptotic distributions of the elements of Q, P(t), and 
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°5 
(t) = P(u)du, the residence matrix, are all normal. The effect 
of letting ¢’—>»0, upon the asymptotic variance of the elements of Q is 
examined in a special case. When ?’ is small the formulas simplify 


considerably. 


Some tests of hypotheses concerning the equality of transition rates, 
transition probabilities, and residencies (elements of e(¢)), and some 
voodness of fit tests of the model are given. ‘The model is extended to 


two (or more) contiguous periods, each with separate transition rate 
matrices. Finally the results of a sampling experiment using a 4 state 


model are given. 
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NOTE ON ESTIMATION IN A MARKOV PROCESS 
WITH CONSTANT TRANSITION RATES * 
BY PAUL MEIER 


Department of Biostatistics, The Johns Hopkins University 


REMARKS 


N the foregoing paper Mr. Zahl presents as a model for follow-up 
| studies the general Markov process with a finite number of states, 
say S,,- - -,S,, and constant transition rates, q;;— transition rate from 
S; to S;. As he remarks, special cases have been examined before, 
notably by Fix and Neyman (’51). Indeed, perhaps the simplest case 
of this process is just the familiar exponential survivorship (or constant 
death rate) model so often used both in medical and industrial 
applications. 

In both the Fix-Neyman paper and in the foregoing excellent study, 
observation is assumed to be incomplete—the status of each subject is 
known only at certain times during the follow-up interval. Although 
such restrictions are clearly appropriate to many applications, there are 
others in which transitions from one state to another are sufficiently 
dramatic (e.g., living to dead) that observation may be regarded as 
essentially complete over the follow-up interval. Since, also, the maxi- 
mum likelihood estimates of the basic parameters, qi; are exceedingly 
simple when there is complete information, it seems appropriate to 


introduce them along with the general stochastic model. 


* Paper no. 304 from the Department of Biostatistics. 
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THE ESTIMATES 


Briefly, gi; is the constant rate of transition from state S, to state 8, 
and the maximum likelihood estimate of Jij is simply 


P Mis 
T,” 
where 
m,; = total transitions observed from S, to Sj, 
summed over all subjects, 


T; = total time under observation in state S,, 
summed over all subjects. 


This is a well known result for the special case of exponential 
survivorship mentioned earlier (Littell, 52, and Epstein and Sobel, ’53) 
in which we have one transient state (surviving) and one absorbing 
state (dead or failed). The extension of the derivation to the general 
ease is straightforward and will not be discussed here. 


AN EXAMPLE 


Consider a model for chronic disease follow-up with three states, 
S,, S., S;, representing the states “well,” “sick,” and “dead.” (S, and 
S, are “transient” states and S, is an “absorbing” state.) Suppose the 
study involves only two individuals starting in S., with the following 
histories. The first becomes well at 1.4 years, relapses at 2.3 years, 
regains health at 3.2 years and remains well for the rest of the 5-year 
follow-up interval. The second becomes well at 0.8 years, relapses at 
1.6 years, and dies at 3.0 years. The data may be tabulated as shown 
in table 1. 

TABLE 1 


CASE 1 CASE 2 
state period state period 
S, 0.0 to 1.4 S, 0.0 to 0.8 
S, 1.4 to 2.3 S, 0.8 to 1.6 
8S, 2.3 to 3.2 S, 1.6 to 3.0 
S, 3.2 to 5.0 S, 3.0 to 5.0 
Since S, is an absorbing state, m,,;==m,.=0, and T, is not needed. 


The required quantities are 





S; 
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T, = [(2.3 — 1.4) + (5.0 —3.2)] + [(1.6—0.8)] 


a= 2 7+ 0.8 = 3.5 


wi 


7, = [(1.4— 0.0) + (3.2 — 2.3)] + [(0.8 — 0.0) + (3.0 —1.6)] 


= 2.34 2.2 = 4.5 
Mi2=1+1—2 m,,;=—0+0—0 
ma B+ lS — sijge Oe? 


The estimates of the transition rates are 


a 2/35 == 0.571 jis = 0/8.1 = 0.000 
3/4.5 = 0.667 jon = 1/4.5 — 0.222. 


DISTRIBUTION OF ESTIMATES 


One’s intuition about the distribution of these estimates may be 
sharpened by examining some methods for generating a random observa- 
tion from any given process. The method indicated in Doob (753) is 
as follows. A subject starting in state S, remains there for a period 
of time randomly chosen from an exponential distribution with mean 
1/qi, (Ge = > Vos)» He is then transferred to a state S;,, chosen at 


IF to 


random from the states other than S,,, the chance of going to S; being 
Gist/Vio (J Ato). The subject then remains in state S;, a time randomly 
chosen from an exponential distribution with mean 1/q;, and then is 
transferred to a state different from S;, and so on. An equivalent method, 
suggested by the notion of competition among the states, is to choose 
i —1 times at random from exponential distributions with means 1/4q;,; 
(j%,) and make a transfer to that state, S;,, for which the chosen 
time was least, making the transition at this minimum time. (The 
needed random observations from exponential distributions can be 
generated from random digits, utilizing the fact that if z is uniformly 
distributed in the interval (0,1), —qlnz has an exponential distribu- 
tion with mean q.) 

The exact distribution of the q;; is not at all a simple matter. If 
they could be regarded as being generated by fired and predetermined 
total observation times, 7, the m;; would have independent Poisson dis- 
tributions and the (4; would have mean qj; and variance qij/T;, giving a 


: Tr ~V au/T. 1 1 
coefficient of variation of V qi/ Ts ie ee TP In fact, the 
ij Vqli VE{miy} 
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likelihood function does not distinguish between the cases of fixed and 
random 7; so that the asymptotic formula for the variance of 4, 
derived in the usual way, is just g;j/7;. Unfortunately the 7; are 
random variables, and the above results for the mean and variance are 
therefore only approximate. If, however, T; is large compared to the 
mean time required for one transfer from S; to S;, that is 1/qi;, the 
effect of its variability should be slight. Since T; must be many times 
1/qi; for qi; to have even modest precision, it seems reasonable to expect 
that the asymptotic Poisson theory will give good service in the cases 
of most interest, that is, when the data provide reasonably precise 
estimates of the ij 

Although the hypothetical study used in the example is far too small 
to provide useful estimates, it will serve to exhibit the method of calcu- 


lation of the variance estimates. Letting 6 4i;/T;, the estimated 
variance of ¢;;, we have 
0.571 163 0.000 
G12° = —— = 0.16: o13° = — == 0.000 
3.0 3.5 
0.667 0.222 
ee, = —— == ().148 G23" = ().049. 


4.5 4.5 


The estimated coefficients of variation are given by 1/\/ mj, and range 
from 58% for g., to infinity for ¢s,, showing the inadequacy of such a 


sample for estimating rates. 
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LONGEVITY OF RETIRED ARMY AND AIR FORCE 
OFFICERS * 
BY C. A. McMAHAN, JOHN K. FOLGER ** AND THOMAS R. FORD 


Air Force Personnel and Training Research Center (OERL) 


Mazwell Air Force Base, Alabama 


INTRODUCTION 


EMOGRAPHERS, actuaries, and vital statisticians have long used 
D average expectation of life as a measure of mortality and of the 
extent to which man has gained control over his biological environment. 
Accompanying the rapid industrialization of our economy and the emer- 
gence of a distinct labor force in recent years, life table principles have 
also been applied to computations of average length of working life. 
Partially as a result of the demographic, economic, and social changes 
of the last century and particularly of the last 50 years, the number of 
persons reaching retirement age has increased tremendously and there 
is now a considerable span of life for millions of persons between the 
date of retirement and death. This particular study is focused upon the 
computation of life expectancy after working life, that is, with meas- 
uring length of life in a retired status. Few life tables have been 
constructed for retired populations. 

The life table applied to retired populations presents certain prob- 
lems, both of construction and interpretation. The several values used 
in its computation can, however, be profitably utilized to: (a) predict 
future deaths in a retired population for a reasonable period of time, 


* Personal views or opinions expressed or implied in this article are not to be 
construed as necessarily carrying the official sanction of the Department of the 
Air Force or of the Air Research and Development Command. 

** Formerly a member of the staff of the Human Resources Research Institute, 
now research associate, Southern Regional Education Board, Atlanta, Georgia. 

The authors are indebted to Jerry Walker Combs, Jr., for critically reading 
the manuscript and for valuable suggestions. 
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(b) measure the man-vears of life remaining for retired individuals— 


an index of manpower potential, and (c) provide a stationary population 


for comparative purposes. The latter is perhaps the most limited in 


utility, and both (a) and (b) above must be used with due regard for 


ail 


the time reference of the life table functions and the structural pecu- 
liarities of the actual retired population. 


MATERIALS AND METHODS 


Selection of the Method Used in This Study 

The retired population which is the concern of this study was com- 
posed of 10,001 United States Army (and Air Force) officers living in 
retirement between January 1, 1925 and December 31, 1948. From this 
group, three were omitted for lack of complete information, leaving 
9,998 for analysis. Of the latter, 1,906 were living in retirement at 
the beginning of the period, and 8,092 retired on or after January 1, 
1925. During the 24-year period there were 2,795 deaths, 1,509 of 
which occurred among officers retiring during the period. Just above 
7,000 (7,203) were therefore living in retirement on December 31, 1948. 

The primary consideration for the selection of the methodology used 
was that the population was small and changing rapidly in both size 
and composition. Because of the small number and limited period of 
time covered, a cohort life table could not be constructed (Jaffe, 751, 
3). Such a table would at any rate have had the added disadvantage 
of the varying ages at which individuals entered the population and 
the purely historical interest of the mortality experience reflected.1.| On 
the other hand, the retired population at any one time was too small 
to allow the construction of a current life table by the usual method 
of using a mid-point population and a three-year average of deaths. 

The method selected may perhaps he classified as a current multiple 
decrement table, but certain departures in selection of population and 
use of deaths have been made (Dublin et al., ’49, pp. 279-280 and U. S. 
Dept. of Labor, *50). Instead of being based on a specified calendar 
year, the rates computed are for a hypothetical “average” year in this 
period. Each of the retired officers has been counted in the population 


at each year of age or portion of year during which his life experience 


1 Observed survival ratios have ber n comy uted. however, for those who were 


in a retired status when the study began as well as for those who retired during 


the period 1925 1929. 





ct US 


LONGEVITY OF RETIRED OFFICERS 127 


could be observed. Thus, most individuals entered into several age 
groups as a result of aging during the 24-year span. Conceivably an 
officer could have entered into the base population of 24 different age 
groups. In actuality, the average number was 7 or 8.2. Thus the popu- 
lation for any year of age consisted of all oflicers in retirement who 
attained that age during the period and all officers who retired at that 
age. The 50-year age group, for example, consisted of all officers who 
retired at age 50 or attained age 50 in retirement during the 24-year 
period. The 51-year age group consisted of all officers who survived 
their fiftieth year, plus those who retired at age 51. In general, since 
additional retirements occurred as age increased, and because officers 
retiring at younger ages grew older during the period, the population 
used for computing age-specific rates generally increased up to age 64. 
It was only after that age that attrition through death outweighed the 
increments to the population. 

Obviously, it was not possible to treat the population arrived at as 
an enumerated population. It was possible, however, to follow two 
courses. The population could have been computed on the basis of the 
total number attaining to each x birthday in retirement, making possible 
the direct computation of the probability of dying; or it was possible 
to convert the population into years lived in retirement at age x, making 
possible the computation of the death rate. Because the former would 
require sacrificing certain data and would involve complications in 
making the computations by 5-year age groups, and because of certain 
advantages in age-specific death rates themselves, it was decided to 
compute the death rate and to deal with the population on a years-lived 
(years of exposure) basis. 

Accordingly, the ,m, (a convenient summary of the conventional 
notation used can be found in Reed and Merrell, 39) for any given age 
zr was computed from a rather than from the more usual es The 

n= @ n 
»D, included all of the deaths for an age group occurring during the 
24-year span, the ,’, representing the years lived (observed) in retire- 


? Computed by dividing total man-years lived during the period by the maxi- 
mum population, 9,998. This gives the average years lived in retirement. 

* Deaths in the age group « to r7+n=,D,; population in the age group # 
to #+n=,P,. For convenience, the observed person-years lived at each age 
has been denoted by ,Z’,. Thus the death rate ,m, really amounts to deaths per 


1,000 years of exposure. 
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ment in the interval z to r+ n. It is immediately apparent that the 
m D 


formula yllty is roughly equivalent to deriving the nits from 


= 
gr 


the ~ values of a life table fs - deaths in the life table age group 


om 
ztox-+n; ,l,—life table population in the age group z to r+ -n). 
Theoretically it may be considered a more accurate value for ,m, than 
that usually employed, but it is perhaps influenced at ages up to 65 by 
the fact that increments to the population occurred during a year of 
life, so that both the numerator and denominator are expanding values 
during a year of life. The theoretical consequences of this fact have 
not been thoroughly explored. They undoubtedly have some bearing 
on the convertibility of the ,m, values to ,g, (probability of dying) 
values by the Reed and Merrell method. 

The methods used in actual computations were those adaptable to 
(1) data available and (2) IBM tabulation.* 


Avatlable Data 
The following data were available for use in the study of length of 
life of retired officers (Official Registers, *25-’49) : 


Recorded data. (1) Date of birth of each officer, (2) date of retire- 
ment of each officer, (3) date of death of each officer who died, (4) date 
of beginning of study (January 1, 1925), (5) date of termination of 
study (December 31, 1948). 


Computed data. (1) Age at retirement of each officer, (2) age as 
of December 31, 1924 for those retiring previous to January 1, 1925, 
(3) age at death of each officer who died, (4) age as of December 31, 
1948 for those officers alive on that date. (All ages were computed in 
years to two decimal places.) The various possibilities of contributions 
to the years lived in retirement were listed and are shown in figure 1. 


Years Lived in Retirement (Years of Exposure) 
The years lived in retirement were computed by the method illus- 
trated in table 1; this method allowed a detailed check of the data at a 


“The authors are indebted to Stephen J. Zolezynski for furnishing the IBM 
methodology; they are indebted to Stephen W. Fotis for providing a manual 


verification on a sample basis of the method used. 
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glance and yet the machine process was simple.® Figure 2 shows the 


“population” (years of exposure) obtained. 


A ge Specific Death Rates 


Having secured the population it was only necessary to compute 
— > 
nm, by the formula , as previous 


, 


ly given, ,L’, denoting the number 
n= 2 


of years lived in the interval z to «+n. As stated previously, an 


D mF 2 | 


approximate equival nce between nas been assumed, 


n n+ 
although each implies a different type of base population. The ym, 


values for the entire retired population, compared with those of white 


As the first step in the computation process (IBM), the population was 


ordered by age at retirement. The next step was to cumulate for each age the 
number of officers who lived in retirement at that age Each person who retired 


counted as contributing a year at the age of retirement and each succeeding 


vear through age 103 Ave 103 was taken as the final age since no officer in 
the population lived beyond that age during the period observed The maximum 


population of 9,998 was reached at age 94 when the last person had retired.) 
This gave the total number of retirement years that would have been contributed 
at each age if (a) a person contributed a full year at the age which he entered 
the population and (b) no persons left the population. Since these two conditions 


did not hold, appropriate corrections had to be made. To correct for the fact 
that few or no officers actually contributed a full year in the year of entry, the 
ictual proportion of the year which they did contribute was computed The 
machine technique selected for doing this was to compute and tabulate the pr 


portion of the year which was not contributed. Then by a clerical operation this 


amount was subtracted from the full year which had been credited at age of entry. 


4 similar technique was used for subtracting dropouts Since an officer was 
credited with a full year for every age after he had entered the population, it was 
necessary » subtract a full year for every age after he had died and for all ages 
which would have been attained after December 31, 1948. This was a matter of 

imulating the number of dropouts from age 23 onward and subtracting the 
cumulative total at each age. For the age at which an individual was lost to 
the population, however, a portion of the year was actually lived in retirement 


Since he was already credited with contributing a full year at that age the 
portion of that year not lived in retirement was subtracted as a correction factor 

In summary, the method was to reckon the maximum retirement years that 
an officer « Id contribute and then to subtract all the full years and fractions 
of years that he was known not to have contributed. Two IBM machine runs 


only were required, the final number of years lived at each age being obtained 


by simple subtraction 
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males in the United States 1939-1941, are shown in figure 3. The ,m, 
values computed for specified portions® of the retired population are 
given in figure 4. 

Smoothing the data. The smooth curves of mortality applying to 
retired officeis shown in figures 3 and 4 are, of course, not the actual 
observed rates obtained. Despite the inflation of the population through 
computing years of exposure, the population at all ages was still 
extremely small. A total of 10,001 individual officers constituted the 
initial population, of whom three had to be eliminated because of lack 
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Fic. 4. AGe-Speciric DEATH RATES OF SUB-GROUPS OF THE RETIRED OFFICER 
POPULATION OF THE UNITED STATES ARMY AND AIR ForRCE, 1925-1948. 


of data. Thus 9,998 officers lived in a retired status averaging less 
than 8 years each. The total population (person-years lived) at all 
ages was therefore less than 80,000, actually 76,172.81. Against these 
there were only 2,795 deaths. The resulting mortality rates were 
therefore highly irregular. On the usual assumption that these irregu- 


9? 


larities are not a feature of the “true” mortality rate, they have been 
graduated mechanically. In this case it appeared that any mathematical 
formula would be as arbitrary a method of smoothing as drawing the 
eurve with the aid of a spline. 


The resultant curves were tested for smoothness by the method of 


*Computations for these sub-groups were made in the manner previously 
described by means of additional “ cut-on” and “ cut-off” dates. 
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third differences. The results compared favorably with Greville’s test 
of white males for 1939-1941 (Greville, ’46, pp. 126-129). In addition, 
the graduated rates were applied to the actual years of exposure and pro- 
duced the results shown in table 2. These were considered satisfactory. 


TABLE 2 
Actual Deaths Compared to Number of Deaths Produced by Smoothed Data for 
Selected Age Groups of Retired Officers of the United States Army 


and Air Force, 1925-1948 


AGE GROUP OBSERVED DEATHS SMOOTHED DATA 
30-39 67 71 
40-49 230 210 
50—59 549 565 
60—69 915 924 
70-79 685 685 
80-89 288 289 


Total 30-89 2734 2739 





Probability of Dying 
Conversion of the computed ,m, values to »gz by means of the 
2m, P 
formula q, = =——— was explored but was discarded in favor of the 
e+- My, 


widely-used Reed and Merrell tables (’39). One possible limitation 
» 
L 


should be recognized here. Since the Reed and Merrell tables give the 
D d 
° P > n I ° ° ° 
relationship of ,m,—= = to nf, ~ it has been assumed in this 
als ale 
: m: ' 
study that the same relationship holds when ,m, = i . Some question 


regarding the equivalence of the two formulas for ,m, has already been 

raised above, but the Reed and Merrell tables appear more satisfactory 
2Me 

9 } 


a My, 


for all ranges of ages than the formula 


RESULTS 


The findings of the present study are presented in figures 3 and 4, 
and in table 3. For the retired population, age-specific rates were 
higher at ages under 65 than were rates for corresponding ages among 
the white male population in 1939-1941. Above age 65, however, rates 
were lower for the retired officer population (figure 3). The higher 
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mortality for retired officers at younger ages probably reflects (1) the 
earlier base period for the retired officer rates and (2) the “reasonably 
expected ” lesser longevity of officers retiring at younger ages (Reed and 
Love, 31). How much of the difference can be attributed to the latter 
is not known, but there seems to be rather clear evidence that the time 
reference has influenced the difference. In figure 4 it can be observed 
that mortality rates are lower for officers retiring recently than for 
those retiring earlier. The factors effecting an increasing longevity for 
the general population in recent years thus appear also to be operative 


among this retired population. 


DISCUSSION 


When the age-specific death rates are carried forward in the life 
table to statements of life expectancy, a certain fiction is inevitable. 
All life tables incorporate such a fiction, since the stationary population 
is never identical with the actual population and the mortality rates 
are rarely applicable over the entire lifetime of any actual cohort. The 
present life table population, however, departs from the actual in rather 
extreme degree. To what extent this departure results in unreasonable 
life expectancies at the younger ages, however, is not known. The life 
expectancy of persons retiring at age 25-29 and earlier has been deter- 
mined not only by their experience, but by those retiring at older ages. 
If those retiring at younger ages enjoy materially less longevity than 
those who retire at older ages, then the life expectancy value for age 


bably misrepresents what could be expected from limiting 


25—29 pri 
analysis to this group alone.’ In projections of total years lived in 
retirement the values should, of course, be obtained from actual data 
on persons who retire at such young ages. The life table population 
is manifestly not appropriate for such purposes. Useful projections 
may be made, however, by relating the average life expectancy at each 
age to the number of officers who retire at those ages. As long as the 
life table values may be considered as approximately applicable to a 
A 


retired population, estimates of cost or of total years lived in retire- 


ment by a retired population may be obtained. 


"It has been possible to observe 8 individuals retiring at ages 25-29 for a 


period of 20 years. At the end of that time, 6 were still alive, two had died. 


aA 
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CONCLUSIONS 

The technique of life table construction utilized in this study of 
retired officers might be profitably applied to many groups. For example, 
in the face of the current teacher shortage, a life table could aid in 
determining the additional personnel that would be available if the legal 
age of retirement were delayed five years. It should be realized however 
that this particular method of life table construction has both advan- 
tages and disadvantages. Its chief advantages are as follows: (1) It is 
an application of a method of life table construction to a relatively small 
and changing population that can be observed over a period of years 
when no cohort for a single year is large enough to warrant the con- 
struction of a conventional current table or can be observed long enough 
to make the construction of a cohort life table feasible. (2) It utilizes 
an uncomplicated machine technique which greatly simplifies the com- 
putational procedures. Its limitations, however, must be admitted. The 
mortality experience is not so recent as that used in strictly current life 
tables, but is less obsolescent than that used in cohort life tables. The 
farther the mid-point of the period from the current date, the less 
predictive value the life table will likely have. Finally, retired popu- 
lations of the present type have a greater range of ages at which indi- 
viduals enter the population than do most populations for which life 
tables have been constructed. Retired officer populations may be more 
extreme in this regard than other retired populations. Wherever similar 
populations are encountered, however, average life expectancy values 
must be interpreted with care. 
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THE HEIGHTS AND WEIGHTS OF MEN ACCORDING 
TO MONTH OF BIRTH 


By A. B. PIET 


Auckland University College 


T the time my book on seasonal fluctuations ( Fitt, °41) was written, 
A no data had been forthcoming to test the relationship of height 
and weight of adults to month of birth. Many studies were available 
of children of different ages. At about the same time two articles 
appeared offering measurements of American university and college 
students in a similar kind of analysis. One was by Mills (41) and the 
other by Chenoweth and Canning (°41). The trends noted were in 
many respects similar to those of the author’s analysis but not identical. 
It must be noted that college students represent a highly selected 
population, not calculated to reveal the growth condition of the popu- 
lation as a whole. In any case they do not represent full adult growth. 
The height and weight records were taken for 16-, 17-, 18- and 19-year- 
old freshmen who were still growing (and at different rates). 

In order to secure data for a fully adult population it was decided 
to analyse the heights and weights of a large sample of the men of the 
New Zealand army draft of World War II. Access was given for this 
purpose to the Army Base Records, Wellington. To secure as perfect 
a sample as possible, names were drawn from files classified under each 


letter of the alphabet and in number proportional to the total number 


of entries under each letter. The names drawn from each letter-group 
(which followed a complete alphabetical sequence) were the ones, up 
to the number required, which came first. [It would appear that in 
this way an almost perfectly random sample of the male population was 
secured. Height measurements of 21,342 and weight measurements of 
21,177 men were thus available. 

The analysis of the height measurements when arranged according 


to the months of birth follows in table 1 which gives the means in inches 
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along with their probable errors and the number of cases. A more 
detailed analysis is presented in table 2, a frequency table giving a break- 
down by months. The original calculations were based on half-inch 
intervals but for reasons of space inch intervals were used here. In 
following the months it must be remembered that the data are for the 
Southern Hemisphere and that the seasons are therefore reversd. 


TABLE 1 


Heights by Birth-Months 


Jan. Feb. Mar. ‘Apr. May June July Aug. Sep. Oct. Nov. Dee. 





Mean 
(inches) 68.19 68.25 68.09 68.06 68.10 67.94 68.16 68.18 68.24 68.16 68.24 68.17 


P.E.m .0409 .0415 .0392 .0389 .0406 .0390 .0392 .0414 .0391 .0406 .0423 .0400 
number 1752 1634 1849 1763 1802 1799 1832 1720 1886 1861 1659 1785 





























HEIGHT WEIGHT 
in. Height Ibs. 
68.15 150.40 
68.10 150.20 
68.05 150.00 
67.95 q f — 149.60 
yy pry yy | | 
67.90 I 149.40 
Jan. Mar. May July Sept. Nov. 
Feb. Apr. June Aug. Oct. Dec. 


Fig. 1. HEIGHT AND WEIGHT BY BIRTH MONTHS. 


Table 1 shows the tallest men were born in February and the shortest 
in June. Those of September and November birth are just shorter than 
those born in February. The differences between the June mean and 
those of these high-level months are significant. Some of the other 


es 
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high means when compared with the June low-level provide fairly signifi- 

cant differences. The mean difference between the tallest group and 

the shortest is just under a third of an inch (.31 inch). These monthly 
TABLE 2 


Distribution of Heights by Birth-Months 


Inches Jan. Feb. Mar. Apr. May June July Aug. Sep. Oct. Nov. Dee. 


50 Plus ] 

= 0 

52 0 

53 0 

54 * l 0 

55 “* 0 0 

56 “ 0 l l 
oT l 0 l 0 l 0 
. iw l 2 l 0 2 0 0 0 
a9 l ] 0 0 l 0 2 2 2 4 0 
60 4 5 4 4 + 4 6 2 4 7) 2 l 
61 11 4 5 6 8 2 ) 5 8 6 5 12 
62 27 20 21 18 20 35 0) lé 2 29 20 17 
63 i8 4 64 46 54 6 15 6 165 3 $5 0 
64 92 68 93 96 111 113 98 98 87 105 83 99 
65 152 143 #182 #171 158 156 139 161 16] 153) «139 )~—S 178 
66 218 218 229 259 230 273 251 245 250 =—236 206 19 
67 251 263 315 252 281 262 06 248 294 277 254 244 
68 306 233 266 265 76 307 93 267 297 9G 272 229 
69 250 226 251 25! 229 226 = 257 246 244 236 225 253 
70 171 190 191 178 184 169 181 151 215 193 172 186 
71 109 «108 = «=117 99 130 100 4112 #2116 «2118 27 «610 110 
72 59 69 62 60 68 y 64 73 75 8] 3 78 
73 43 24 28 33 25 23 26 32 41 $2 47 32 
74 12 15 12 10 14 12 12 13 15 10 25 13 
75 4 4 6 3 0 8 6 2 ) 7 2 
7 3 ] 3 2 3 ] 4 2 ] l 
77 0 ] 0 0 l 

78 l l l 


fluctuations are illustrated in figure 1. The main trends of this analvsis 


correspond to those ¢ 


f studies reported by the author in 1941 with 
reference to the heights of children. 


The corresponding facts for weights according to month of birth are 


given in tables 3 and 4 and figure 1. The distributions in table 4 work 
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TABLE 3 
Weights by Birth-Months 


Jan Feb. Mar. Apr. May June July Aug. Sep. Oct. Nov. Dec. 


Mean 
(Ibs) 150.41 150.11 149.81 149.78 149.48 149.44 149.91 150.41 150.57 149.67 150.19 150.76 
P.E.m $182 .3275 .3061 .3163 .3011 .3066 .3216 .8178 .2946 .3045 .3174 .3155 


number 1758 1583 1822 1739 1799 1799 1825 1724 1849 1860 1688 1781 


TABLE 4 
Distribution of Weights by Birth-Months 


Pounds Jan. Feb. Mar. Apr. May June July Aug. Sep. Oct. Nov. Dee. 


86 Plus l ] l 
— 4 ] 3 ] 3 6 0 3 2 2 2 
ie. “ 8 4 1] 14 4 5 12 8 9 14 5 8 
110 * 40 27 4 3 44 0 26 39 34 45 28 34 
118 “ 78 77 8? 88 94 90 87 82 67 85 87 95 
6. * 216 208 254 232 248 227 256 204 223 245 211 220 
ie = 304 280 323 283 270 307 283 286 282 278 262 276 
i<Z ™ 300 27 312 315 33] 304 328 312 390 388 289 332 
150 “ 270 261 266 292 306 310 313 262 297 273 271 «291 
is8 “ 201 164 224 194 184 189 201 201 219 206 178 175 
166“ 15] 123 133 115 144 137 143 140 140 122 125 129 
74 “ 81 59 61 59 66 67 56 75 67 76 77 98 
182 4) 9 )] {4 50 52 49 52 52 60 52 56 
190 27 23 29 3] 24 26 27 28 24 28 19 27 
198 12 10 22 18 22 1] 12 12 17 14 17 19 
206 10 7 14 10 4 10 12 11 12 15 6 7 
214 6 6 3 2 4 5 7 7 6 6 6 5 
222 6 4 0 : ] 3 ] ] 5 ] ] 3 
230 ] 2 ] 4 ] 1 3 2 2 ] 0 2 
238 ] ] 0 2 0 ] 2 0 ] 2 

246 0 l 0 0 ] 


406 “ l 
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with 8-pound intervals instead of the original 4-pound intervals. The 


lightest men are born in June and the heaviest those born in December 
1.32 pounds. The June measure is part of an 


t 
I 


with a difference of 
extended low-level period and the December one similarly of a high- 


level period. 
f 1 the others are not 


The differences between the lowest weight an 


highly significant. This is due partly at least to the skewing in the 
weight-distributions well indicated by table 4, a fact that is related to 
the tendency of the older men toward excessive weight. This feature 


in adult weight was noted by Kendall (43). 


SUMMARY 


r 21,000 men of the New Zealand army draft for World War II. 


; . . : ’ } , 
.. \n ha Sis Was mace ot the height and weight measurements 


2. The tallest men were those born in February (summer) and the 
shortest those born in June (winter). There was an average difference 

h iest men were those of December birth and the lightest 
those born in June. There was an average difference of about one and 
one-third pounds between these weight-levels. 


t. The main differences in height were significant and those in the 
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Standard Values in Nutrition and Metabolism. Edited by Errert C. 
ALBRITTON. Pages xiii+ 380, W. B. Saunders Co., Philadelphia, 


1954. $6.50. 


In view of the rapid, almost terrifying rate of accumulation of 
scientific data, largely quantitative in nature, a periodical critical sifting 
and compilation of reported results is an important supplement of 
research productive of new facts. Consequently, the idea of a compre- 
hensive handbook of biological data has a real appeal. 

This ambitious project, receiving substantial federal financial support, 
was entrusted to a special committee, as an organ of the American 
Institute of Biological Sciences, affiliated with the National Research 
Council. In addition to the volume on blood (published in 1952) and 
the present fascicle, it is planned to cover in separate volumes the fields 
of growth and reproduction, animal and plant physiology, biochemical 
composition, and toxicology. There is to be, also, an abridged handbook, 
covering all the areas. 

The current volume consists of 233 pages of tables, 16 pages of 
diagrams, some 100 pages of bibliography indicating the source of the 
tabular entries, and an index. The readers of this journal are likely to 
be particularly interested in data on man’s daily nutrient allowances 
(bringing out the unresolved differences between the American, Canadian 
and British figures) ; functions of chemical elements and vitamins, with 
signs of deficiency and excess; excretion of the end products of meta- 
bolism ; and basal metabolism, including the presently proposed standards 
and a comparison with other norms. The standards commonly used 
heretofore in America were based on measurements made on subjects 
unaccustomed to the procedure and were higher, while the British 
standards were based on the lowest of repeated measurements on trained 
persons under rigorously basal conditions. As example, the earlier 
American (Boothby, Berkson, and Dunn, 1936), current American 
(Boothby, 1952), and the British (Robertson and Reid, 1952) standards 
for 25-year-old men are 40.3, 38.4 and 37.1 Cal/sqm/hr, respectively. 

It is hoped that anthropometric data, important for the characteri- 
zation of man’s nutriture, will be included in one of the subsequent 
volumes. 

JOSEF Brozek 
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Coronary Heart Disease in Young Adulls. By MeNARD M. GERTLER, 
Paut D. Wuirr, AND OTHERS. Pages xvili + 218, Harvard Uni- 


versity Press, Cambridge, Massachusetts, 1954. $5.00. 


Sanitation and the antibiotics are so successfully solving the health 
problems caused by other organisms that soon only the faults of the 
human organism itself may remain to plague us. That these, however, 
will be more than enough to engage all our research energy is obvious 
from a cursory glance at our vital statistics. In spite of great advances 
in diagnosis, surgery and medical care, it is obvious that there is as 
yet little progress towards the control and prevention of our self- 
generating ills that now dominate the health picture in this country. 

From the standpoint of mortality, the outstanding problem is now 
coronary heart disease, which bids fair to account for half the deaths in 
the near future. The appalling frequency of coronary heart disease, how- 
ever, is perhaps less significant than the fact that it attacks and kills 
some OI us at an early age while some of us seem to be immune. Of 
course, we want to know why, but the first step towards answering this 
must be to discover who. In other words, how do the patients with 
coronary heart disease differ from their more fortunate fellows ? 

Fifteen years ago Dr. Paul D. White began to attack this question 
by comparing patients with clinically healthy controls, and the present 
volume is a report on a continuation of this inquiry with a hundred 
young (under 40) coronary patients and a suitable control series. 


In many characteristics, including relative body weight, the coronary 


patients were not peculiar, but several significant differences were found. 


The coronary patients tended to be “stocky” or “ mesomorphic,” to 
have relatively high concentrations of cholesterol and of urie acid. in 
the blood, and to produce saliva of rather special characteristics. These 
peculiarities are quite non-specific, but they are statistically significant. 
Much has been made in the past of the fact that men are more prone 
than women to early coronary disease. In the present study, Gertler, 


White and their colleagues used various methods to explore the question 
of a masculinity factor, but the results are inconclusive. They found no 
difference between the patients and the controls in the excretion of 
male sex hormones. In physique, the patients tended to be more masculine 
than the controls but psychologically they were judged to be more 


feminine. 


A dietary inquiry was also inconclusive except that it was clear that 
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the coronary patients did not consume large amounts of cholesterol nor 
eat a very heavy diet at the time they were studied. However, this was 
the situation after the development of coronary disease and it would 
be more useful to know their lifelong dietary pattern before the develop- 
ment of the disease and all the resulting changes in the mode of life. 

The work by Gertler, White and their colleagues was frankly explora- 
tory. This clear presentation of their findings will help point the way 
to profitable new studies. By the same token their results tend to 
discount the probable value of further studies with some of the methods 
they tried. In sum, this book gives emphasis to the great need for larger 
and more systematic efforts to discover the characteristics of body and 
of mode of life associated with the development of atherosclerosis and 
coronary heart disease. It is probable that there is no single cause but 
only influences to be found and that all of us are susceptible in some 
degree. To discover the effective interplay of nature versus nurture will 
require attack from many approaches and with many methods. Future 
multidisciplinary research will build on these studies. 

ANCEL KEys 


1951 Impairment Study. By the Sociery or ActTuarres. Pages 300, 
Society of Actuaries, New York, 1954. $7.50. 


For some fifty years, the Society of Actuaries has been publishing 
in irregular intervals the results of medical impairment studies conducted 
upon material contributed by American insurance companies. The last 
major study concerned itself with a twenty-year period ending in 1928. 
Minor studies were conducted in 1936 and 1938. The present investiga- 
tion was based upon the experience of 1935-1949 upon the lives of 
applicants who were medically examined and applied for ordinary insur- 
ance policies ; 27 life insurance companies representing 70% of ordinary 
insurance in force in the United States and Canada agreed to take 
part in this investigation. 

The purpose of the study was to make recent mortality experience 
available for analysis of the effects of specified “ impairments ” (such 
as overweight, family history, heart murmurs and other circulatory 
conditions) on death rates. The practical end was to provide a basis 
for adjustments in insurance premiums (extra premiums) to be paid by 


individuals with less favorable life expectancy. 
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As a control and a reference point, a basic mortality table was con- 
structed reflecting the death rate for the corresponding period among 
those applicants who were considered unimpaired. Since in previous 
studies other basic mortality tables have been used, a comparison of the 
effects of a particular impairment in two or more periods is difficult. 
For example, if an impairment would show 150% of standard mortality 
currently and was rated the same in a previous study, it would not 
necessarily indicate that this impairment has produced comparable mor- 
tality in the two periods. It would merely mean that it produced 50% 
more mortality than the group which, in the corresponding period, was 
considered standard by the underwriters. 

In interpreting the findings for purposes other than those for which 


ry 
they were originally designed, the limitations of the data must be 


considered. The statistics are derived from a selected sample drawn 
5 i 


from among white male adults in the middle income group. It is no 


necessarily representative of the population as a whole. Mortality ratios 


were computed on the basis of the number of policies and not on the 
number of lives. That means that if a person held several policies, 
he appears statistically like several exposures, and if such person d ed, 
several deaths were recorded. While the control group was selected on 


the basis of a medical examination, the standards were probably not the 
same as a researcher would wish to apply in selecting a group of clinically 
ring 


healthy persons. The study did not concern itself with the uncovering 


of more basic associations but mainly with impairment classifications 


and indicators, some of which do not represent any specific clinical 


diagnosis but only reports of physical findings or laboratory analysis. 


All of this does not mean that t] g 


1e study is not highly significant for 


if 


the purposes for which it was conducted but that in its prese 


is not particularly useful for the practicing physician or the medical 


nt form 1 


investigator. This is not the fault of the authors. who have fulfilled 
their assignment rather well. It is the result of an unfortunate lac 


a 4 


‘ ea 
of agreement upon standards between the clinician, the researcher, and 


the interpreter of insurance medical findings, and of the practical limi- 
tations of medical examinations conducted in connection with the issuance 
of life insurance policies. In some instances. onlv very recent re search 
work, conducted outside the framewerk of actuarial mortalit Invest 

gations, has pointed to significant indicators of “ impairment.” in the 
sense of predictors of unfavorable health or longevity (cf.. smoking or, 


apparently, high fat intake and elevated level of blood cholesterol). 
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In the impairment studies of the Society of Actuaries, there is much 
material remaining dormant which is important to human biology. 
Additional criteria of impairment may usefully be added. The insurance 
industry has a unique opportunity to aid research because it is able 
to observe large groups of initially clinically healthy persons over long 
periods of time and evaluate the role of individual differences in “ con- 
stitution” (with its anthropometric, biochemical, physiological, and 
psychological aspects) and in the mode of life. 


E. H. KiLeperar 


America’s Resources of Specialized Talent. The Report of the Commis- 
sion on Human Resources and Advanced Training, Prepared by 
DareL WoLFLE, Pages xviii +- 332, Harper & Bros., New York, 1954, 
$4.00. 


This book had its beginnings in the concern of the National Research 
Council, the American Council of Learned Societies, the American 
Council on Education, and the Social Science Research Council with the 
relation between the need for and the availability of highly trained 
specialists in many areas affecting national welfare and security. The 
report is the outgrowth of studies that began in 1949 with the appoint- 
ment of the Commission, with Dael Wolfie as director of research. The 
work is premised on the assumption that this country’s place in world 
affairs for years to come will depend upon effective utilization of its 
intellectual resources, and this leads directly into a series of fact-packed 


chapters that contribute answers, in some measure, to such questions as: 


What are the prospects for effective utilization of specialized talent? 
What are the future needs for it? What is the potential supply? How 
adequate are we utilizing these potentialities ? 

The report begins with a detailed classification of the fields of 
specialization. In general the Commission confines itself to those fields 
for which four or more years of college training are required. This 
leads into a comprehensive analysis of trends in college graduation by 
specialized fields, at both the bachelor’s and the Ph. D. level. Another 
basic chapter is devoted to the occupational distribution of college gradu- 
ates, with particular interest centering in the relationship between the 
field of college specialty and actual employment. This poses a funda- 
mental question: To what extent is there flexibility in the utilization 
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of trained manpower? The relationship between degree of specialization 
and ultimate use of trained ability has significant social implications. 
The relation between supply and demand in the specialized fields is 
carefully analyzed, and as in all of the chapters, the assumptions that 
underlie the handling of the basic data are carefully stated. By general 


= , 
fields, present 


size of the professionally employed groups, estimates of 
current demand, and supply-demand prospects are set forth. 

The logical question then follows: How much scientific talent is 
? There 


not being utilized; how much potential ability is being wasted 
is elaborate consideration of who goes to college and the ability-levels 
of the college population. The variations in intellectual ability among 
the specialized fields, by educational level, are striking evidence of a 
selective process that is going on. 

Finally, the report discusses the utilization of the educated specialist 
and considers means for improving utilization of the potential supply. 

Certain broad conclusions to which the Commission comes can be 
suggested here: the trend toward further specialization is unquestionable, 
and no evidence exists that a saturation point has been or is about to 
be reached. Because of the close relationship between college graduation 
and specialization, the future number of such graduates (now about 
270,000; in 1970 to be about 590,000) is the key to future supply. 
Today, known shortages of specialists exist, and some degree of shortage 
ean be expected to exist for many years, at least in some fields. While 
college graduates tend to remain in their field of original training, 
a major 


se 


there is a large amount of shifting, and this flexibility is 
national asset,” since it contributes to more effective utilization of skilled 
manpower. 

It has further to be concluded, the report demonstrates, that much 
rh talent is being wasted. Colleges could be double in 
ith no lowering in quality. It is not that the college 


population is too big; rather, that a significant percentage of high ability 


youth do not v0 TO college. Fields of specialization do select on an 
ability basis: students in physical sciences, business and home economics. 
These differences, overlapping as they are, are less marked than the 
differences between college and non-college groups. 

The Commission believes there is waste in utilizing available trained 


ability and that there are steps that could be taken to improve this 


situation. There is need, Too, for more attention to identifying, 


encouraging and even subsidizing the potentially able students so that 
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they will receive the specialized training that existing and future needs 
call for. 

No review as brief as this can do more than suggest the wealth of 
material assembled in this volume, and the ingenuity and care with 
which it has been drawn together and analyzed. Certainly the book 
does not answer all the questions that it raises, but it provides the best 
available starting point for the further investigations that should follow. 
It is a stimulating volume, not only because of the information it 
provides about existing human resources, field by field, but even more 
because of its broad implications that fundamentally touch national 
policy, and through policy, future national security. 


Matcotm M. WILLEY 


Human Limbs and their Substitutes. Prepared under the sponsorship 
of the Advisory Committee on Artificial Limbs, National Research 
Council. Edited by Paut E. Kiopsree and Pxitie D. WILson, 
Pages 714, McGraw, Hill, New York, 1954, $12.00. 


Three facts stand out as one attempts to assess the worth and 
significance of this large volume: 1) the range and diversity of the 
fields from which the contributors have been drawn; 2) the monumental 
nature of the work upon a subject which has previously had virtually 
no modern literature; and 3) the testimony it bears to the admirable 
coherence of a program which has been centralized about a pressing 
human problem but which requires for its solution the concerted action 
of diverse specialists. 

The variety of contributing fields to the practice of prosthetics 
(limb fitting) is explained when it is realized that the stump is produced 
by the medical man, the artificial limb is fabricated by a paramedical 
specialist, the prosthetist, using mechanical components which are devised 
and manufactured by engineering and industry, and the amputee is 
aided in his biomechanical adjustment to the limb by the physical or 
occupational therapist, and in his emotional readjustment by the psychol- 
ogist. Accordingly, there are chapters dealing with surgery, such as 
amputation, amputee pain and cineplasty; with prosthetics, such as 
prescription and limb fabrication; and with rehabilitation, such as 
training, therapy and psychological treatment. 




















150 BOOK REVIEWS 


Since it is also a report on recent research and development, the 
book contains much new and unique material in both the biological and 
technical categories. Exhaustive locomotion and manipulation studies 
are reported, and the outstanding developments in replacement mecha- 
nisms are described. These applications provide many significant bio- 
technical lessons. The excellent amputee pain studies represent advances 
in both theory and medical practice. The development of the cosmetic 
glove itself is a story of achievement in applied high polymer chemistry. 

In summary, Human Limbs is an essential reference work for all 
those who work directly with amputees—surgeons, prosthetists, and 
therapists. But for many others it provides excellent source material on 
body mechanics, specialized engineering mechanisms and materials, and 
rehabilitation concepts and methods. 

Craig L. TAYLOR 


A Methodological, Psychiatric and Statistical Study of a Large Swedish 
Rural Population. By TaGe Larsson and TorstEN SJOGREN. Acta 
Psychiatrica et Neurologica Scandinavica, Suppl. 89, Pages 250, 


Ejnar Munksgaard, Copenhagen, 1954, 25 Swedish krona. 


This is the first volume of the newest of the Scandinavian population 
studies. It describes, for a change, not an isolate but a district selected 
as average and representative of Swedish rural population, thus providing 
a psychiatric base line for the whole of the country. 

The ambition of the study, as seen in the first volume just published, 
has been the fullest feasibile ascertainment of the incidence of mental 
illness. The completeness of ascertainment is of utmost importance 
because the first step in genetic investigation of mental disorders is a 
comparison of familial concentration with incidence in the general 
population. This task has been completed satisfactorily; the study has 
furnished another set of reliable basic data, and this is its worth. 

The authors have investigated the incidence of psychiatric disorders 
and of low grade mental deficiency in an area consisting of 13 parishes 
comprising two large and a number of small islands off the west coast 
of Sweden, close to the mainland. The area had a population of about 
25,000 and there were some 1300 registered cases of major mental 
disorder. In order to avoid the well known but rarely heeded deficiencies 


of ascertainment inherent in the cross-sectional census method, the 





BOOK REVIEWS 151 


authors scrutinized their population through a 45-year period of observa- 
tion. Their technique had been seasoned by previous Scandinavian field 
investigations. Most, if not all, of those investigations had been planned 
as a technical improvement on the population sampling methods initiated 
by the Munich school of psychiatric genetics. Judged from this view- 
point, this monograph represents a technical peak in the line of some 
four decades of endeavor to master the problems of sampling for the 
biological study of mental disorder. 

The usability of their “ period method” is largely limited to Sweden 
with her enviable system of registration. The selected period of observa- 
tion, 1900-1944, covers the whole of development both of modern 
psychiatry and genetics. Certainly, the attitude toward mental illness 
was not the same at the turn of the century as it is at the present time. 
Clinical opinion, as well as diagnostic methods, too, underwent radical 
changes. Thus some uncertainty as to completeness and uniformity of 
ascertainment would be anticipated. The authors themselves acknowledge 
the shortcomings of this longitudinal observation by stating that the 
most representative figures were obtained from the period of 1920- 
1940. Aside from the uncertainties arising from the progress of time 
one may anticipate diagnostic imperfections inherent in any study 
depending for basic information on perusal of hospital records and field 
investigation. The authors are aware of that. One wonders if the labors 
and good will invested into such extensive field study could not be 
utilized in a more permanent manner, e. g., by setting up a permanent 
clinic where a regular diagnostic job on a university hospital level 
could be done. Study of more neutral humain traits could be easily 
incorporated, and in this the human biologist would eagerly share. 

The analysis of excess mortality linked to psychoses deserves atten- 
tion. It is here calculated in terms of the remaining mean expectation 
of life. It has been again found that a psychosis which is not connected 
with frank somatic pathology shortens life significantly, although it 
does not kill directly. A few figures give a hint on biological forces 
which may make people move from their homes: about one quarter of 
the psychotics born in the area were not residing there at the time of 
the onset of their illness, which appears to be well in excess of the 
emigration rate. It can be inferred that certain aspects of the pre-morbid 
personality made easier the decision to migrate. This would tie in 
with Odegaard’s finding of a higher incidence of schizophrenia in 


Norwegian-born persons in Minnesota than in the population of Norway. 
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There is a great deal of thinking, experience and honest labor behind 


the many tables and the lengthy explanations of the procedure in the 


first volume of the monograph. The thoroughness of ascertainment of 
this study is in keeping with the tradition of Scandinavian population 
studies. The second volume probably will bring more in the way of 


general conclusions, and will be eagerly awaited. 


KAREL PLANANSKY 
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